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This invention relates to newly identified breast, ovarian, breast cancer and/or ovarian cancer related polynucleotides and the 
polypeptides encoded by these polynucleotides herein collectively known as "breast/ovarian cancer antigens", and to the complete gene 
sequences associated therewith and to the expression products thereof, as well as the use of such breast/ovarian cancer antigens for detection, 
prevention and treatment of disorders of the female reproductive system, particularly disorders of the breast and/or ovary, including the 
presence of breast cancer and/or ovarian cancer. This invention relates to the breast/ovarian cancer antigens as well as vectors, host cells, 
antibodies directed to breast/ovarian cancer antigens and recombinant and synthetic methods for producing the same. Also provided are 
diagnostic methods for diagnosing and treating, preventing and/or prognosing disorders related to the female reproductive system, particularly 
disorders of the breast and/or ovary, including breast cancer and/or ovarian cancer, and therapeutic methods for treating such disorders. The 
invention further relates to screening methods for identifying agonists and antagonists of breast/ovarian cancer antigens of the invention. 
The present invention further relates to methods and/or compositions for inhibiting the production and/or function of the polypeptides of 
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Honmaini Breast amid Ovariami Camicer Associated 
Gene Sequences and Polypeptides 

5 Field of the Invention 

This invention relates to newly identified breast, ovarian, breast cancer, and ovarian 
cancer related polynucleotides and the polypeptides encoded by these polynucleotides herein 
collectively known as "breast/ovarian cancer antigens," and to the complete gene sequences 
associated therewith and to the expression products thereof, as well as the use of such 

10 breast/ovarian cancer antigens for detection, prevention and treatment of disorders of the 
female reproductive system, specifically disorders of the breast or ovary, particularly the 
presence of breast and/or ovarian cancer. This invention relates to the breast/ovarian cancer 
antigens as well as vectors, host cells, antibodies directed to breast/ovarian cancer antigens 
and recombinant and synthetic methods for producing the same. Also provided are 

15 diagnostic methods for diagnosing and treating, preventing and/or prognosing disorders 
related to the female reproductive system, specifically disorders of the breast and/or ovary, 
including breast cancer and/or ovarian cancer, and therapeutic methods for treating such 
disorders. The invention further relates to screening methods for identifying agonists and 
antagonists of breast/ovarian cancer antigens of the invention. The present invention further 

20 relates to methods and/or compositions for inhibiting the production and/or function of the 
polypeptides of the present invention. 

Background of the Invention 

Breast cancer represents the most frequent caiise of early morbidity and mortality in 

25 women in North America (Harris et al. New Eng. J. Med. 327:319, 390 and 473 (1992)). It is 
generally believed that this malignancy arises from a multi step process involving mutations 
in a relatively small number of genes, perhaps 10 or less. These mutations result in significant 
changes in the growth and differentiation of breast tissue that allow it to grow independent of 
normal cellular controls, to metastasize, and to escape immune surveillance. The genetic 

30 heterogeneity of most breast cancers suggests that they arise by a variety of initiating events 
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and that the characteristics of individual cancers are due to the collective pattern of genetic 
changes that accumulate (Harris et al. New Eng. J. Med. 327:319, 390 and 473 (1992)). 

The classes of genes that are involved in breast cancer are not unlike those found in a 
number of other well characterized malignancies, although some are highly specific for breast 
5 cancer. In particular, mutations in the genes that encode receptors involved in binding to 
estrogen and progesterone are particularly important because they likely cause the breast cells 
to proliferate while rendering them unresponsive to the antitumor effects of these hormones 
in advanced malignancy. In addition, changes in the genes that encode growth factors, other 
receptors, signal transduction molecules, and transcription factor molecules are frequently 

10 involved and have alterations that are involved in the development and progression of breast 
cancer (King, Nature Genetics 2:125 (1992)). The characterization of the type and number of 
mutations seen in individual breast cancers is useful in classifying the biological properties of 
individual cancers and in determining the prognosis for individual patients. For example, the 
erbB2/HER2/neu gene is particularly valuable in predicting the prognosis of both node- 

15 positive and node-negative patients based on the amplification status of the gene (King, 
Science 250:1684 (1990)). Several additional members of this family have been discovered 
but the ligand for erbB2/HER2/neu remains unknown. It is anticipated that further advances 
in therapeutics will be achieved by the development of therapies that disrupt aberrant growth 
signaling pathways or affect the cellular interactions of breast cancer cells with native stroma 

20 or metastatic sites. 

Although oncogenes are likely to be very important in breast cancer, tumor suppressor 
genes may also play an important role. Certain of these genes, including p53 and Rb-1, are 
essential to the normal mechanisms that control cell cycle events, especially those 
checkpoints at the border of the different stages of the cell cycle (Hollstein et al. Science 

25 253:49 (1991); Srivastava et al. Nature 348:747 (1990)). 

In 1969, Li and Fraumeni documented a familial cancer syndrome that had an 
autosomal dominant pattern of expression (Li et al, Ann. Intern. Med. 71:747 (1969)). 
Members of these families had sarcomas, breast cancers, brain tumors, leukemias, 
adrenocortical carcinomas, and other malignancies. Family studies demonstrated that the 

30 gene responsible for the syndrome was located on chromosome 17, and examination of the 
p53 gene as a candidate gene revealed that this gene was mutated in five families (Malsin et 
al. Science 250:1233 (1990)). In the last two years, two genes linked to familial breast cancer, 



wo 00/55173 



PCT/USOO/05881 



designated BRCAl and BRCA2, have been isolated and characterized. BRCAl is at 17q21 
(Claus et al, Am. J. Epidemiology 131:961 (1990); Hall et al, Science 250:1684 (1990); 
Easton et al. Am. J. of Human Genetics 52 (4):678 (1993); Black et al, Am. J. of Human 
Genetics 52 (4):702 (1993); Bowcock et ah Am. J. of Human Genetics 52 (4):718 (1993); 
5 Miki et al. Science 266:66 (1995)). The demonstration of loss of heterozygosity (LOH) at 
17q25 has defined another potential tumor suppressor gene (Lindblom et al, Human Genetics 
91:6 (1993); CorneUs et al, Oncogene 8:781 (1993); Theile et al, Oncogene 10:439 (1995)). 

There is a need, therefore, for identification and characterization of such factors that 
modulate activation and differentiation of breast and ovarian cells, both normally and in 

10 disease states. In particular, there is a need to isolate and characterize additional molecules 
that mediate apoptosis, DNA repair, tumor-mediated angiogenesis, genetic imprinting, 
immune responses to tumors and tumor antigens and, among other things, that can play a role 
in detecting, preventing, ameliorating or correcting dysfunctions or diseases. 

The present invention relates at least in part, to a novel breast and ovarian and breast 

15 and ovarian cancer related polynucleotides and polypeptides. The discovery of these breast 
and ovarian cancer related polynucleotides provides new compositions v/hich are useful in 
the diagnosis, prevention and treatment of disorders of the female reproductive system, 
particularly of the ovary including, but not limited to ovarian cancer, and the breast, including 
but not limited to breast cancer. 

20 

Summary of the Invention 

The present invention includes isolated nucleic acid molecules comprising, or 
alternatively, consisting of, a breast, ovarian, breast cancer and/or ovarian cancer associated 

25 polynucleotide sequence disclosed in the sequence listing (as SEQ ID Nos:l to 418) and/or 
contained in a human cDNA clone described in Tables 1, 2 and 5 and deposited with the 
American Type Culture Collection ("ATCC"). Fragments, variant, and derivatives of these 
nucleic acid molecules are also encompassed by the invention. The present invention also 
includes isolated nucleic acid molecules comprising, or alternatively consisting of, a 

30 polynucleotide encoding a breast, ovarian, breast cancer, and/or ovarian cancer polypeptide. 
The present invention ftirther includes breast, ovarian, breast cancer, and/or ovarian cancer 
polypeptides encoded by these polynucleotides. Further provided for are amino acid 
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4 

sequences comprising, or alternatively consisting of, breast, ovarian, breast cancer, and/or 
ovarian cancer polypeptides as disclosed in the sequence listing (as SEQ ID Nos: 419 to 836) 
and/or encoded by a human cDNA clone described in Tables 1, 2 and 5 and deposited with 
the ATCC. Antibodies that bind these polypeptides are also encompassed by the invention. 
5 Polypeptide fragments, variants, and derivatives of these amino acid sequences are also 
encompassed by the invention, as are polynucleotides encoding these polypeptides and 
antibodies that bind these polypeptides. Also provided are diagnostic methods for diagnosing 
and treating, preventing, and/or prognosing disorders related to the female reproductive 
system, specifically disorders related to the breast and/or ovary, including breast cancer 
10 and/or ovarian cancer, and therapeutic methods for treating such disorders. The invention 
further relates to screening methods for identifying agonists and antagonists of breast/ovarian 
cancer antigens of the invention. 

Detailed Description 

15 

Tables 

Table 1 summarizes some of the breast/ovarian cancer antigens encompassed by the 
invention (including contig sequences (SEQ ID NO:X) and the cDNA clone related to the 
contig sequence) and further summarizes certain characteristics of the breast/ovarian cancer 

20 polynucleotides and the polypeptides encoded thereby. The first column shows the "SEQ ID 
NO:" for each of the 418 breast/ovarian cancer antigen polynucleotide sequences of the 
invention. The second column provides a unique "Sequence/Contig ID" identification for 
each breast, ovarian, breast cancer and/or ovarian cancer associated sequence. The third 
column, "Gene Name," and the fourth column, "Overlap," provide a putative identification 

25 of the gene based on the sequence similarity of its translation product to an amino acid 
sequence found in a publicly accessible gene database and the database accession no. for the 
database sequence having similarity, respectively. The fifth and sixth columns provide the 
location (nucleotide position nos. within the contig), "Start" and "End", in the polynucleotide 
sequence "SEQ ID NO:X" that delineate the preferred ORF shown in the sequence listing as 

30 SEQ ID NO:Y. The seventh and eighth columns provide the "% Identity" (percent identity) 
and "% Similarity" (percent similarity), respectively, observed between the aligned sequence 
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segments of the translation product of SEQ ID NO:X and the database sequence. The ninth 
column provides a unique "Clone ID" for a cDNA clone related to each contig sequence. 

Table 2 summarizes ATCC Deposits, Deposit dates, and ATCC designation numbers 
of deposits made with the ATCC in connection with the present application. 
5 Table 3 indicates public ESTs, of which at least one, two, three, four, five, ten, fifteen 

or more of any one or more of these public EST sequences are optionally excluded from 
certain embodiments of the invention. 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope-bearing 
fragments present in most of the breast, ovarian, breast cancer or ovarian cancer associated 
10 polynucleotides described in Table 1 as predicted by the inventors using the algorithm of 
Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. The Jameson- Wolf antigenic 
analysis was performed using the computer program PROTEAN (Version 3.1 1 for the Power 
Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). Breast, ovarian, breast 
cancer and/or ovarian cancer associated polypeptides (e.g., SEQ ID NO:Y, polypeptides 
15 encoded by SEQ ID NO:X, or polypeptides encoded by the cDNA in the referenced cDNA 
clone) may possess one or more antigenic epitopes comprising residues described in Table 4. 
It will be appreciated that depending on the analytical criteria used to predict antigenic 
determinants, the exact address of the determinant may vary slightly. The residues and 
locations shown in column two of Table 4 correspond to the amino acid sequences for most 
20 breast, ovarian, breast cancer and/or ovarian cancer associated polypeptide sequence shown 
in the Sequence Listing. 

Table 5 shows the cDNA libraries sequenced, and ATCC designation numbers and 
vector information relating to these cDNA libraries. 

25 

Definitions 

The following definitions are provided to facilitate understanding of certain terms 
used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
30 environment (e.g., the natural environment if it is naturally occurring), and thus is altered "by 
the hand of man" from its natural state. For example, an isolated polynucleotide could be part 
of a vector or a composition of matter, or could be contained within a cell, and still be 
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6, 

"isolated" because that vector, composition of matter, or particular cell is not the original 
environment of the polynucleotide. The term 'Isolated" does not refer to genomic or cDNA 
libraries, whole cell total or mRNA preparations, genomic DNA preparations (including 
those separated by electrophoresis and transferred onto blots), sheared whole cell genomic 
5 DNA preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide/sequences of the present invention. 

As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X (as described in column 1 of Table 1) or the related 
cDNA clone (as described in column 9 of Table 1 and contained within a library deposited 

10 with the ATCC). For example, the polynucleotide can contain the nucleotide sequence of the 
full length cDNA sequence, including the 5' and 3' untranslated sequences, the coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having an amino acid 
sequence encoded by a polynucleotide of the invention as broadly defined (obviously 

15 excluding poly-Phenylalanine or poly-Lysine peptide sequences which result from translation 
of a polyA tail of a sequence corresponding to a cDNA). 

In the present invention, "SEQ ID NO:X" was often generated by overlapping 
sequences contained in multiple clones (contig analysis). A representative clone containing 
all or most of the sequence for SEQ ID NO:X is deposited at Human Genome Sciences, Inc. 

20 (HGS) in a catalogued and archived library. As shown in column 9 of Table 1, each clone is 
identified by a cDNA Clone ID. Each Clone ID is unique to an individual clone and the 
Clone ID is all the information needed to retrieve a given clone from the HGS library. In 
addition to the individual cDNA clone deposits, most of the cDNA libraries from which the 
clones were derived were deposited at the American Type Culture Collection (hereinafter 

25 "ATCC"). Table 5 provides a list of the deposited cDNA libraries. One can use the Clone ID 
to determine the library source by reference to Tables 2 and 5. Table 5 lists the deposited 
cDNA libraries by name and links each library to an ATCC Deposit. Library names contain 
four characters, for example, "HTWE." The name of a cDNA clone ("Clone ID") isolated 
from that library begins with the same four characters, for example "HTWEP07*'. As 

30 mentioned below, Table 1 correlates the Clone ID names with SEQ ID NOs. Thus, starting 
with a SEQ ID NO, one can use Tables 1 , 2 and 5 to determine the corresponding Clone ID, 
from which library it came and in which ATCC deposit the library is contained. Furthermore, 
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it is possible to retrieve a given cDNA clone from the source library by techniques known in 
the art and described elsewhere herein. The ATCC is located at 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, USA. The ATCC deposits were made persuant to the terms 
of the Budapest Treaty on the international recognition of the deposit of microorganisms for 
5 the purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained in 
SEQ ID NO:X, or the complement thereof (e.g., the complement of any one, two, three, four, 
or more of the polynucleotide fragments described herein), and/or sequences contained in the 

10 related cDNA clone within a library deposited with the ATCC. "Stringent hybridization 
conditions" refers to an overnight incubation at 42 degree C in a solution comprising 50% 
formamide, 5x SSC (750 mM NaCl, 75 mM trisodium citrate), 50 mM sodium phosphate (pH 
7,6), 5x Denhardt's solution, 10% dextran sulfate, and 20 |ig/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in O.lx SSC at about 65 degree C. 

15 Also included within '^polynucleotides" of the present invention are nucleic acid 

molecules that hybridize to the polynucleotides of the present invention at lower stringency 
hybridization conditions. Changes in the stringency of hybridization and signal detection are 
primarily accomplished through the manipulation of formamide concentration (lower 
percentages of formamide result in lowered stringency); salt conditions, or temperature. For 

20 example, lower stringency conditions include an overnight incubation at 37 degree C in a 
solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, pH 
7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; followed by 
washes at 50 degree C with IXSSPE, 0.1% SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher salt 

25 concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, 
heparin, denatured salmon sperm DNA, and commercially available proprietary formulations. 

30 The inclusion of specific blocking reagents may require modification of the hybridization 
conditions described above, due to problems with compatibihty. 
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Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 
3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary 
stretch of T (or U) residues, would not be included in the definition of "polynucleotide," since 
such a polynucleotide would hybridize to any nucleic acid molecule containing a poly (A) 
5 stretch or the complement thereof (e.g., practically any double-stranded cDNA clone 
generated using oligo dT as a primer). 

The polynucleotides of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. For example, polynucleotides can be composed of single- and 

10 double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- 
and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, 
hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, 
double-stranded or a mixture of single- and double-stranded regions. In addition, the 
polynucleotide can be composed of triple-stranded regions comprising RNA or DNA or both 

15 RNA and DNA. A polynucleotide may also contain one or more modified bases or DNA or 
RNA backbones modified for stability or for other reasons. "Modified" bases include, for 
example, tritylated bases and unusual bases such as inosine. A variety of modifications can 
be made to DNA and RNA; thus, "polynucleotide" embraces chemically, enzymatically, or 
metabolically modified forms. 

20 In specific embodiments, the polynucleotides of the invention are at least 15, at least 

30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous nucleotides 
but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5kb, 5 kb, 2.5 kb, 
2.0 kb, or I kb, in length. In a further embodiment, polynucleotides of the invention 
comprise a portion of the coding sequences, as disclosed herein, but do not comprise all or a 

25 portion of any intron. In another embodiment, the polynucleotides comprising coding 
sequences do not contain coding sequences of a genomic flanking gene (i.e., 5' or 3' to the 
gene of interest in the genome). In other embodiments, the polynucleotides of the invention 
do not contain the coding sequence of more than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 
3, 2, or 1 genomic flanking gene(s). 

30 "SEQ ID NO:X" refers to a breast/ovarian cancer antigen polynucleotide sequence 

described in Table 1. SEQ ID NO:X is identified by an integer specified in column I of Table 
1. The polypeptide sequence SEQ ID NO:Y is a translated open reading frame (ORF) 
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encoded by polynucleotide SEQ ID NO:X. There are 418 breast/ovarian cancer antigen 
polynucleotide sequences described in Table 1 and shown in the sequence listing (SEQ ID 
N0:1 through SEQ ID NO:418). Likewise there are 418 polypeptide sequences shown in the 
sequence listing, one polypeptide sequence for each of the polynucleotide sequences (SEQ ID 
5 NO:419 through SEQ ID NO:836). The polynucleotide sequences are shown in the sequence 
listing immediately followed by all of the polypeptide sequences. Thus, a polypeptide 
sequence corresponding to polynucleotide sequence SEQ ID NO.T is the first polypeptide 
sequence shown in the sequence listing. The second polypeptide sequence corresponds to the 
polynucleotide sequence shown as SEQ ID N0:2, and so on. In otherwords, since there are 

10 418 polynucleotide sequences, for any polynucleotide sequence SEQ ID NO:X, a 
conesponding polypeptide SEQ ID NO;Y can be determined by the formula X + 418 = Y. In 
addition, any of the unique "Sequence/Contig ID" defined in column 2 of Table I, can be 
linked to the corresponding polypeptide SEQ ID NO:Y by reference to Table 4. 

The polypeptides of the present invention can be composed of amino acids joined to 

15 each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides may be 
modified by either natural processes, such as posttranslational processing, or by chemical 
modification techniques which are well known in the art. Such modifications are well 
described in basic texts and in more detailed monographs, as well as in a voluminous research 

20 literature. Modifications can occur anywhere in a polypeptide, including the peptide 
backbone, the amino acid side-chains and the amino or carboxyl termini. It will be 
appreciated that the same type of modification may be present in the same or varying degrees 
at several sites in a given polypeptide. Also, a given polypeptide may contain many types of 
modifications. Polypeptides may be branched, for example, as a result of ubiquitination, and 

25 they may be cyclic, with or without branching. Cyclic, branched, and branched cyclic 
polypeptides may result from posttransiation natural processes or may be made by synthetic 
methods. Modifications include acetylation, acylation, ADP-ribosylation, amidation, 
covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of 
a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 

30 covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, formation 
of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation. 
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10 

hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer- 
RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 2nd 
5 Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONALCOVALENT MODIFICATION OF PROTEINS, B. C. Johnson, 
Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et aL, Meth Enzymol 182:626-646 
(1990); Rattan et al„ Ann NY Acad Sci 663:48-62 (1992).) 

The breast, ovarian, breast cancer and/or ovarian cancer polypeptides of the invention 

10 can be prepared in any suitable manner. Such polypeptides include isolated naturally 
occurring polypeptides, recombinantly produced polypeptides, synthetically produced 
polypeptides, or polypeptides produced by a combination of these methods. Means for 
preparing such polypeptides are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the mature 

15 form, or may be a part of a larger protein, such as a fusion protein (see below). It is often 
advantageous to include an additional amino acid sequence which contains secretory or 
leader sequences, pro-sequences, sequences which aid in purification, such as multiple 
histidine residues, or an additional sequence for stability during recombinant production. 

The breast, ovarian, breast cancer and/or ovarian cancer polypeptides of the present 

20 invention are preferably provided in an isolated form, and preferably are substantially 
purified. A recombinantly produced version of a polypeptide, including the secreted 
polypeptide, can be substantially purified using techniques described herein or otherwise 
known in the art, such as, for example, by the one-step method described in Smith and 
Johnson, Gene 67:31-40 (1988). Polypeptides of the invention also can be purified from 

25 natural, synthetic or recombinant sources using techniques described herein or otherwise 
known in the art, such as, for example, antibodies of the invention raised against the 
polypeptides of the present invention in methods which are well known in the art. 

By a polypeptide demonstrating a "functional activity" is meant, a polypeptide 
capable of displaying one or more known functional activities associated with a full-length 

30 (complete) protein of the invention. Such functional activities include, but are not limited to, 
biological activity, antigenicity [ability to bind (or compete with a polypeptide for binding) 
to an anti-polypeptide antibody], immunogenicity (ability to generate antibody which binds to 
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a specific polypeptide of the invention), ability to form multimers with polypeptides of the 
invention, and ability to bind to a receptor or ligand for a polypeptide. 

"A polypeptide having functional activity" refers to polypeptides exhibiting activity 
similar, but not necessarily identical to, an activity of a polypeptide of the present invention, 
5 including mature forms, as measured in a particular assay, such as, for example, a biological 
assay, with or without dose dependency. In the case where dose dependency does exist, it 
need not be identical to that of the polypeptide, but rather substantially similar to the dose- 
dependence in a given activity as compared to the polypeptide of the present invention (i.e., 
the candidate polypeptide will exhibit greater activity or not more than about 25-fold less 

10 and, preferably, not more than about tenfold less activity, and most preferably, not more than 
about three-fold less activity relative to the polypeptide of the present invention). 

The functional activity of the breast/ovarian cancer antigen polypeptides, and 
fragments, variants derivatives, and analogs thereof, can be assayed by various methods. 

For example, in one embodiment where one is assaying for the ability to bind or 

15 compete with full-length polypeptide of the present invention for binding to an antibody to 
the full length polypeptide antibody, various immunoassays known in the art can be used, 
including but not limited to, competitive and non-competitive assay systems using techniques 
such as radioimmunoassays, ELISA (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, 

20 immunodiffusion assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope 
labels, for example), western blots, precipitation reactions, agglutination assays (e.g., gel 
agglutination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc. In one 
embodiment, antibody binding is detected by detecting a label on the primary antibody. In 

25 another embodiment, the primary antibody is detected by detecting binding of a secondary 
antibody or reagent to the primary antibody. In a further embodiment, the secondary 
antibody is labeled. Many means are known in the art for detecting binding in an 
immunoassay and are within the scope of the present invention. 

In another embodiment, where a ligand is identified, or the ability of a polypeptide 

30 fragment, variant or derivative of the invention to multimerize is being evaluated, binding can 
be assayed, e.g., by means well-known in the art, such as, for example, reducing and non- 
reducing gel chromatography, protein affinity chromatography, and affinity blotting. See 
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generally, Phizicky, E., et al., Microbiol. Rev. 59:94-123 (1995). In another embodiment, 
physiological coirelates polypeptide of the present invention binding to its substrates (signal 
transduction) can be assayed. 

In addition, assays described herein (see Examples) and otherwise known in the art 
5 may routinely be applied to measure the ability of polypeptides of the present invention and 
fragments, variants derivatives and analogs thereof to elicit polypeptide related biological 
activity (either in vitro or in vivo). Other methods will be known to the skilled artisan and 
are within the scope of the invention. 

10 

Breast, Ovarian, Breast Cancer and Ovarian Cancer Associated Polynucleotides and 
Polypeptides of the Invention 

It has been discovered herein that the polynucleotides described in Table 1 are 
expressed at significantly enhanced levels in human breast, ovarian, breast cancer and/or 

15 ovarian cancer tissues. Accordingly, such polynucleotides, polypeptides encoded by such 
polynucleotides, and antibodies specific for such polypeptides find use in the prediction, 
diagnosis, prevention and treatment of disorders related to the female reproductive system, 
specifically disorders of the breast and/or ovary, including breast cancer and/or ovarian 
cancer as more ftilly described below. 

20 Table 1 summarizes some of the polynucleotides encompassed by the invention 

(including contig sequences (SEQ ID NO:X) and the related cDNA clones) and further 
summarizes certain characteristics of these breast, ovarian, breast cancer and/or ovarian 
cancer associated polynucleotides and the polypeptides encoded thereby. 
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The first column of Table I shows the "SEQ ID NO:" for each of the 418 
breast/ovarian cancer antigen polynucleotide sequences of the invention. 

The second column in Table 1, provides a unique "Sequence/Contig ID'* identification 
for each breast, ovarian, breast cancer and/or ovarian cancer associated sequence. The third 
5 column in Table 1 , "Gene Name/' provides a putative identification of the gene based on the 
sequence similarity of its translation product to an amino acid sequence found in a publicly 
accessible gene database, such as GenBank (NCBI). The great majority of the cDNA 
sequences reported in Table I are unrelated to any sequences previously described in the 
literature. The fourth column, in Table 1, "Overlap," provides the database accession no. for 

10 the database sequence having similarity. The fifth and sixth columns in Table 1 provide the 
location (nucleotide position nbs. within the contig), "Start" and "End", in the polynucleotide 
sequence "SEQ ID NO:X" that delineate the preferred ORF shown in the sequence listing as 
SEQ ID NO:Y. In one embodiment, the invention provides a protein comprising, or 
alternatively consisting of, a polypeptide encoded by the portion of SEQ ID NO:X delineated 

15 by the nucleotide position nos. "Start'' and "End". Also provided are polynucleotides 
encoding such proteins and the complementary strand thereto. The seventh and eighth 
columns provide the "% Identity" (percent identity) and "% Similarity" (percent similarity) 
observed between the aligned sequence segments of the translation product of SEQ ID NO:X 
and the database sequence. 

20 The ninth column of Table I provides a unique "Clone ID" for a clone related to each 

contig sequence. This clone ID references the cDNA clone which contains at least the 5' most 
sequence of the assembled contig and at least a portion of SEQ ID NO:X was determined by 
directly sequencing the referenced clone. The reference clone may have more sequence than 
described in the sequence listing or the clone may have less. In the vast majority of cases, 

25 however, the clone is believed to encode a full-length polypeptide. In the case where a clone 
is not full-length, a full-length cDNA can be obtained by methods described elsewhere 
herein. 

Table 3 indicates public ESTs, of which at least one, two, three, four, five, ten, or 
more of any one or more of these public ESTs are optionally excluded from the invention. 
30 SEQ ID NO:X (where X may be any of the polynucleotide sequences disclosed in the 

sequence listing as SEQ ID NO: 1 through SEQ ID NO:418) and the translated SEQ ID NO:Y 
(where Y may be any of the polypeptide sequences disclosed in the sequence listing as SEQ 
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ID N0:418 through SEQ ID NO:836) are sufficiently accurate and otherwise suitable for a 
variety of uses well known in the art and decribed further below. For instance, SEQ ID 
NO:X has uses including, but not limited to, in designing nucleic acid hybridization probes 
that will detect nucleic acid sequences contained in SEQ ID NO:X or the related cDNA clone 

5 contained in a library deposited with the ATCC. These probes will also hybridize to nucleic 
acid molecules in biological samples, thereby enabling immediate applications in 
chromosome mapping, linkage analysis, tissue identification and/or typing, and a variety of 
forensic and diagnostic methods of the invention. Similarly, polypeptides identified from 
SEQ ID NO:Y have uses that include, but are not limited to, generating antibodies which 

10 bind specifically to the breast/ovarian cancer antigen polypeptides, or fragments thereof, 
and/or to the breast/ovarian cancer antigen polypeptides encoded by the cDNA clones 
identified in Table 1. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or deletions 

15 of nucleotides in the generated DNA sequence. The erroneously inserted or deleted 
nucleotides cause frame shifts in the reading frames of the predicted amino acid sequence. In 
these cases, the predicted amino acid sequence diverges from the actual amino acid sequence, 
even though the generated DNA sequence may be greater than 99.9% identical to the actual 
DNA sequence (for example, one base insertion or deletion in an open reading frame of over 

20 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide sequence or 
the amino acid sequence, the present invention provides not only the generated nucleotide 
sequence identified as SEQ ID NO:X, the predicted translated amino acid sequence identified 
as SEQ ID NO:Y, but also a sample of plasmid DNA containing the related cDNA clone 

25 (deposited with the ATCC, as set forth in Table 1). The nucleotide sequence of each 
deposited clone can readily be determined by sequencing the deposited clone in accordance 
with known methods. Further, techniques known in the art can be used to verify the 
nucleotide sequences of SEQ ID NO:X. 

The predicted amino acid sequence can then be verified from such deposits. 

30 Moreover, the amino acid sequence of the protein encoded by a particular clone can also be 
directly determined by peptide sequencing or by expressing the protein in a suitable host cell 
containing the deposited human cDNA, collecting the protein, and determining its sequence. 
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The present invention also relates to vectors or plasmids which include such DNA 
sequences, as well as the use of the DNA sequences. The material deposited with the ATCC 
on: 

5 Table 2 



ATCC Deposits 


Deposit Date 


ATCC Designation Number 


LPOl, LP02, LP03, LP04, 
LP05, LP06, LP07, LP08, 
LP09, LP 10, LP 11, 


May-20-97 


209059, 209060, 209061, 209062, 
209063, 209064, 209065, 209066, 
209067, 209068, 209069 


LP12 


Jan- 12-98 


209579 


LP13 


Jan- 12-98 


209578 


LP 14 


Jul- 16-98 


203067 


LP15 


Jul- 16-98 


203068 


LP16 


Feb- 1-99 


203609 


LP17 


Feb- 1-99 


203610 


LP20 


Nov- 17-98 


203485 


LP21 


Jun- 18-99 


PTA-252 


LP22 


Jun- 18-99 


PTA-253 


LP23 


Dec-22-99 


PTA-1081 



each is a mixture of cDNA clones derived from a variety of human tissue and cloned in either 
a plasmid vector or a phage vector, as shown in Table 5. These deposits are referred to as 

10 "the deposits" herein. The tissues from which the clones were derived are listed in Table 5, 
and the vector in which the cDNA is contained is also indicated in Table 5. The deposited 
material includes the cDNA clones which were partially sequenced and are related to the 
SEQ ID NO:X described in Table 1 (column 9). Thus, a clone which is isolatable from the 
ATCC Deposits by use of a sequence listed as SEQ ID NO:X may include the entire coding 

15 region of a human gene or in other cases such clone may include a substantial portion of the 
coding region of a human gene. Although the sequence listing lists only a portion of the 
DNA sequence in a clone included in the ATCC Deposits, it is well within the ability of one 
skilled in the art to complete the sequence of the DNA included in a clone isolatable from the 
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ATCC Deposits by use of a sequence (or portion thereof) listed in Table 1 by procedures 
hereinafter further described, and others apparent to those skilled in the art. 

Also provided in Table 5 is the name of the vector which contains the cDNA clone. 
Each vector is routinely used in the an. The following additional information is provided for 
5 convenience. 

Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap XR (U.S. 
Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 
5,286,636), pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. y5;7583-7600 (1988); 
Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. /7;9494 (1989)) and pBK (Alting- 

10 Mees, M. A. et al., Strategies 5;58-61 (1992)) are commercially available from Stratagene 
Cloning Systems, Inc., 11011 N. Torrey Pines Road, La Jolla, CA, 92037. pBS contains an 
ampicillin resistance gene and pBK contains a neomycin resistance gene. Phagemid pBS 
may be excised from the Lambda Zap and Uni-Zap XR vectors, and phagemid pBK may be 
excised from the Zap Express vector. Both phagemids may be transformed into E. coli strain 

15 XL-1 Blue, also available from Stratagene. 

Vectors pSportl, pCMVSport 1.0, pCMVSport 2.0 and pCMVSport 3.0, were 
obtained from Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport 
vectors contain an ampicillin resistance gene and may be transformed into E. coli strain 
DHIOB, also available from Life Technologies. See, for instance, Gruber, C. E., et al., Focus 

20 75;59 (1993). Vector lafmid BA (Bento Soares, Columbia University, New York, NY) 
contains an ampicillin resistance gene and can be transformed into £. coli strain XL-1 Blue. 
Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday Avenue, Carlsbad, CA 
92008, contains an ampicillin resistance gene and may be transformed into E, coli strain 
DHIOB, available from Life Technologies. See, for instance, Clark, J. M., Nuc. Acids Res, 

25 ;6.-9677-9686 (1988) and Mead, D. et ai, Bio/Technology 9: (1991). 

The present invention also relates to the genes corresponding to SEQ ID NO:X, SEQ 
ID NO:Y, and/or the cDNA contained in a deposited cDNA clone. The corresponding gene 
can be isolated in accordance with known methods using the sequence information disclosed 
herein. Such methods include, but are not limited to, preparing probes or primers from the 

30 disclosed sequence and identifying or amplifying the corresponding gene from appropriate 
sources of genomic material. 
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Also provided in the present invention are allelic variants, orthologs, and/or species 
homologs. Procedures known in the art can be used to obtain full-length genes, allelic 
variants, splice variants, full-length coding portions, orthologs, and/or species homologs of 
genes corresponding to SEQ ID NO:X, SEQ ID NO:Y, and/or the cDNA contained in the 
5 related cDNA clone in the deposit, using information from the sequences disclosed herein or 
the clones deposited with the ATCC. For example, allelic variants and/or species homologs 
may be isolated and identified by making suitable probes or primers from the sequences 
provided herein and screening a suitable nucleic acid source for allelic variants and/or the 
desired homologue. 

10 The present invention provides a polynucleotide comprising, or alternatively 

consisting of, the nucleic acid sequence of SEQ ID NO:X, and/or the related cDNA clone 
(See, e.g., columns 1 and 9 of Table 1). The present invention also provides a polypeptide 
comprising, or alternatively, consisting of, the polypeptide sequence of SEQ ID NO:Y, a 
polypeptide encoded by SEQ ID NO:X, and/or a polypeptide encoded by the cDNA in the 

15 related cDNA clone contained in a deposited library. Polynucleotides encoding a polypeptide 
comprising, or alternatively consisting of, the polypeptide sequence of SEQ ID NO:Y, a 
polypeptide encoded by SEQ ID NO:X, and/or a polypeptide encoded by the the dDNA in the 
related cDNA clone contained in a deposited library, are also encompassed by the invention. 
The present invention further encompasses a polynucleotide comprising, or alternatively 

20 consisting of, the complement of the nucleic acid sequence of SEQ ID NO:X, and/or the 
complement of the coding strand of the related cDNA clone contained in a deposited library. 

Many polynucleotide sequences, such as EST sequences, are publicly available and 
accessible through sequence databases and may have been publicly available prior to 
conception of the present invention. Preferably, such related polynucleotides are specifically 

25 excluded from the scope of the present invention. To list every related sequence would 
unduly burden the disclosure of this application. Accordingly, for each "Contig Id" listed in 
the first column of Table 3, preferably excluded are one or more polynucleotides comprising 
a nucleotide sequence described in the second column of Table 3 by the general formula of a- 
b, each of which are uniquely defined for the SEQ ID NO:X corresponding to that Contig Id 

30 in Table 1. Additionally, specific embodiments are directed to polynucleotide sequences 
excluding at least one, two, three, four, five, ten, or more of the specific polynucleotide 
sequences referenced by the Genbank Accession No. for each Contig Id which may be 
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included in column 3 of Table 3. In no way is this listing meant to encompass all of the 
sequences which may be excluded by the general formula, it is just a representative example. 
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Table 3 



Sequence/ 
Contig ID 


General formula 


Cenbank Accession No. 




riciciuDiy caCIUqcq tiuiti iiic prcbcni 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1899 ofSEQ IDNO:i,bis 
an integer of 15 to 1913, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:K and where b is greater than or 
equal to a + 14. 


lOoJoJ. IOoOOj, 1 oOj IJ. I oOj In, K I Zj jO, 

R31374, R32873, R37282, R84617, R85369, 
R9917h H48474. N23871. N58201, N74557, 
W90334. AA03 1318, AA03 1 427, AA 13023 1 , 
AA256587 




Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
foiTHula of a-b, where a is any integer 
between 1 to 141 1 of SEQ ID N0:2, b is 
an integer of 15 to 1425, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:2, and where b is greater than or 
equal to a + 14. 


D'^f*</n xyA'^fii^ D^o^TA Don^j/io d/cicai 

H08662, H77556, H97365, N24198, N33 135, 
N74546. N93573, W0294L W52194, 
AA004624, AA00472 1 , AA0467 1 0, AA235395, 
AA235479 


DUO / / / 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 340 of SEQ ID N0:3, b is 
an integer of 15 to 354, where both a 
and b coiTespond to the positions of 
nucleotide residues shown in SEQ ID 
N0:3, and where b is greater than or 
equal to a + 14. 




3U0O/0 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 

Frirmiila nf n-K \!i/hpr/» q ic arw/ int^fypr 
lUilllUIa \JL a~U, WUCIC a lb ally llLlCgCl 

between 1 to 500 of SEQ ID N0:4, b is 
an integer of 15 to 5 14, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SO A, and where b is greater than or 
equal to a + 14. 


/ 1 /D, AAlzOJZ, AA12 7094 


508968 


r*referably excluded from the present 
mvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 


T7 1 94 1 , T94428, T945 1 4. H023 1 3 , N269 1 3, 
N47870, N66244, N92418. W31301, W42459, 
W42564, AA08403I, AA 126786, AA258050, 
AA459772 
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formula of a-b, where a is any integer 
between 1 to 2021 of SEQ ID N0:5, b is 
an integer of 15 to 2035, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:5, and where b is greater than or 
equal to a + 14. 




509029 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 182 of SEQ ID N0:6, b is 
an integer of 1 5 to 11 96, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 6, and where b is greater than or 
equal to a + 14. 


Rll213,R11271, H14072, H14071, H5153I, 
H66637, H66636, W23707, W35307, 
AA025586, AA025710, AA058796, AAl 13917 


519726 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 610 of SEQ ID N0:7, b is 
an integer of 1 5 to 624, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:7, and where b is greater than or 
equal to a + 14. 


AA236015, AA236085, AA256106 


522632 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 287 of SEQ ID NO:8, b is 
an integer of 1 5 to 30 1 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:8, and where b is greater than or 
equal to a + 14. 




524655 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 672 of SEQ ID N0:9, b is 
an integer of 15 to 686, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
M0:9, and where b is greater than or 
equal to a+ 14. 


T66495, R15869, R3 9696, HI 6266, H20784, 
H22599, N68 1 50, W5800 1 , W57856 


525847 


Preferably excluded from the present 
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invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 383 of SEQ ID NO: 10, b is 
an integer of 15 to 397, where both a 
and b coiTespond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 0, and where b is greater than or 
equal to a + 14. 




530306 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 549 of SEQ ID NO:! 1, b is 
an integer of 1 5 to 563, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 1, and where b is greater than or 
equal to a + 14. 




532818 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 429 of SEQ ID NO: 12, b is 
an integer of 1 5 to 443, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 12, and where b is greater than or 
equal to a + 14. 


AA 188990, AM 9 1040 


533385 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
i^etween I to 2424 of SEQ ID NO: 13, b 
is an integer of 15 to 2438, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 3, and where b is greater than or 
equal to a + 14. 




533532 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
•ormula of a-b, where a is any integer 
between 1 to 2333 of SEQ ID NO: 14, b 
is an integer of 1 5 to 2347, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEO ID 


T94240, T77619, R13236, R17515, R33142, 
R33294, R39249, R40318, R42609, R42609, 
R403 1 8, R75952, H03594, H 1 2337, H 1239 1 , 
H70913, H70916, H70996, H71001, H87858, 
H70913, N21374, N31326, N35068, N35435, 
N43807, N45045, W46431, W46486, W519I7, 
AAO 19546, AAO 18858, AA056764, AA056767, 
AA058441, AA058445, AA083228, AA083269, 
AAl 15939, AA 122236, AA 1 47307, AA 159802. 
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NO: 14, and where b is greater than or 
equal to a + 14. 


A A 1 650 1 5 . A A 1 65642. A A 1 8 1 869, AA 1 86834, 
AA252269. AA255892. AA463239, AA463240 


534852 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1992 of SEQ ID NOils, b 
is an integer of 15 to 2006, where both a 
and b coiTCspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 5. and where b is greater than or 
equal to a 14. 


T55469, T63434. R10603, R10604, H50597, 
H92640. H94634, W39162, W93243, W94634. 
W947 1 9, N90240. AA053667, AA 1673 12, 
AA253414. AA253389 


537910 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 972 of SEQ ID NO: 16, b is 
an integer of 15 to 986, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 16, and where b is greater than or 
equal to a + 14. 


R23785 


538460 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1575 of SEQ ID NO: 17, b 
is an integer of 15 to 1589. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 17, and where b is greater than or 
equal to a + 14. 


R 13084, R40514, R40514, R55303, R55402, 
W67446 


539577 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 832 of SEQ ID NO: 18, b is 
an integer of 1 5 to 846, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
^O: 1 8, and where b is greater than or 
squal to a + 14. 


T49208, N35488, AA088419, AA127572, 
AA 1 27649, AA 15631 6, AA 1 69250 


548379 

I 

1 
1 


Preferably excluded from the present 
nvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 
between 1 to 2178 of SEQ ID NO: 19, b 


R23778, H70824 



wo 00/55173 



PCT/USOO/05881 



80 





is an integer of 1 5 to 2 1 92, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 19. and where b is greater than or 
equal to a + 14. 




548489 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 997 of SEQ ID NO:20, b is 
an integer of 15 to 1011, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:20, and where b is greater than or 
equal to a 14. 


T4986L T49862, T56225, T56367, T72I70, 
T72948, T92867, T74728, R08625, R08719, 
R 1 7408, R24674, R25174, R25378, R25997, 
R26800, R28401, R3I330, R31589, R42642, 
R45259, R42642, R45259, R62552, R62553, 
R66386, R67726, R6878K R68878, H25120, 
H2512K H41 1 15, H41 190, H41 191, R84227, 
R87629, H53386, H64419, H64476, H72640, 
H7264 1 . H644 1 9, H9930 1 , N2234 1 , N25846, 
N2937a N29843, N4791 8, N57261, N59763, 
N63813. N94171, W23786, W45524, W721 11, 
W77797, AA010718, AAOl 1 164, AA033553, 
AA033554, AA062727, AA062741, AA062784, 
AA06981 1, AA075470, AA075471, AA081844, 
AA083492, AA084442, AA 100358, AA 126263, 
AA126354, AA136544, AA136648, AA146862, 
AA 146863, AA 179509, AA 1 79540, AA 179775, 
AAI80492, AA181719, AA188903, AA189140, 
AA226959, AA227247 


548595 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2005 of SEQ ID N0:21, b 
is an integer of 15 to 2019, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:21, and where b is greater than or 
equal to a + 14. 


T6I537, T69836, R10679, R42501, R46798, 
R4250L R46798, H05289, H05822, HI 2239, 
HI 68 16, H40312, R86905, R86985, N21432, 
N73268, W73102, N91565, AA033533, 
AA053026, AA121547, AA127684, AA190356, 
AA195451, AA226965, AA232522, AA258142 


549337 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2008 of SEQ ID NO:22, b 
is an integer of 15 to 2022, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:22, and where b is greater than or 
equal to a + 14. 




549777 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 1 1 2 of SEQ ID NO:23, b 


T81557, R27931, R38730, R39493, R39494, 
R66845, R67942, R69099, R69214, R69613, 
R69703, R69740, R72430, R72478, R73090, 
R73091,R73872, R73955, R82662, R82715, 
H01096, H01097, H721 13, N76I39, W58493, 
W72884, W74409, W94644, W92532, 
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is an integer of 15 to 1 126. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:23, and where b is greater than or 
equal to a + 14. 


AA0229 1 6, A A0229 1 7. A A03966 1 . A A039660, 
AA043439, AA054965, AA 1 52376, AA 148360, 
AA181225, AA188435 


553091 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2584 of SEQ ID NO:24. b 
is an integer of 1 5 to 2598, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:24, and where b is greater than or 
equal to a+ 14. 




553827 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 397 of SEQ Id 'nO:25, b is 
an integer of 1 5 to 4 11 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:25, and where b is greater than or 
equal to a + 14. 




556350 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 643 of SEQ ID NO:26, b is 
an integer of 15 to 657, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:26, and where b is greater than or 
equal to a + 14. 


T70920, R01856, R37402, H21077, H21531, 
R94734, N29364, N32255, N80553, W07675, 
W58340, W58661, W67208, W67352, 
AA039658, AA039659, AA046392, AA055650, 
AA058365, AA070442, AA088882, AA 102056, 
AAI34I44, AA165363. AA171617, AA173761. 
AA 173771, A A252260, AA464575, AA464679 


556351 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1889 of SEQ ID NO:27, b 
is an integer of 15 to 1903, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 27, and where b is greater than or 
equal to a + 14. 


T70981. R0i855, R 13494, H21076, H24431, 
H24460, R94817, N47912, AA040086, 
AA040133, AA055706, AA056162, AA058484, 
AAI02055, AA102304, AA130304, AA173608, 
AA 195879 


557007 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 


H13846, HI3894, H16354, H20742, H20743. 
R97935, R97936, H87445, N29633, AA015991, 
AA045671, AA045670, AA099I54, AA099252 



wo 00/55173 



PCTAJSOO/05881 



82 





sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1319ofSEQ ID NO:28, b 
is an integer of 15 to 1333. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:28. and where b is greater than or 
equal to a + 14. 




558140 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 tol 3 1 3 of SEQ ID NO:29, b 
is an integer of 1 5 to 1 327, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:29, and where b is greater than or 
equal to a + 14. 


T6299K W58535, W58500, AA053629, 
AA083878, AAl 12892, AA157250, AA157345, 
AA 194089^ AA253436, AA250750 


558456 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 695 of SEQ ID NO:30. b is 
an integer of 1 5 to 709, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 30, and where b is greater than or 
equal to a + 14. 




558708 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1094 of SEQ IDN0:31,b 
is an integer of 1 5 to 1 108, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 3 1 , and where b is greater than or 
equal to a + 14. 


R38385, W24640, W48793, W49619 


574789 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 5 12 of SEQ ID NO:32, b is 
an integer of 15 to 526, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:32, and where b is greater than or 
equal to a + 14. 


^M49156 
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578203 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 541 of SEQ ID N0:31 b is 
an integer of 1 5 to 555, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:33, and where b is greater than or 
equal to a + 14. 


AA 149853 


585385 


Preferably excluded froni the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 333 of SEQ ID NO:34, b is 
an integer of 1 5 to 347, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:34, and where b is greater than or 
equal to a + 14. 




588869 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 736 of SEQ ID NO:35, b is 
an integer of 15 to 750, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
f^JO:35, and where b is greater than or 
equal to a + 14. 




597076 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1277 of SEQ ID NO:36, b 
is an integer of 1 5 to 1291, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
"^0:36, and where b is greater than or 
equal to a+ 14. 




598656 


Preferably excluded from the present 
nvention are one or more 
jolynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
jetween I to 1521 of SEQ IDNO:37, b 
s an integer of 15 to 1535, where both a 
ind b correspond to the positions of 
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nucleotide residues shown in SEQ ID 
NO:37, and where b is greater than or 
equal to a + 14. 




611880 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 281 of SEQ ID NO:38, b is 
an integer of 15 to 295, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:38, and where b is greater than or 
equal to a + 14. 




614329 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1286 of SEQ ID NO: 39, b 
is an integer of 1 5 to i 300, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:39, and where b is greater than or 
equal to a + 14. 


T49777, T51334, T49778, T66835, T66836, 
178401, R33579, R33684, R3436I, R34476, 
R72556, R75702, H0I59K H02719, H13232, 
H 1 3599, H 1 3942, H 1 3943, H63376, H80729, 
H80730, H89353, H89539, H99395, N26995, 
N32930, N401 16, N42081, N50408, N50460, 
N63978, N67308, N92847, W464I3, 
AA126994, AA128I41, AA146958, AA146957, 
AA425764 


616066 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 201 of SEQ ID N0:4b, b is 
an integer of 15 to 215, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:40, and where b is greater than or 
equal to a + 14. 




620956 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 460 of SEQ ID N0:4 1 , b is 
an integer of 1 5 to 474, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
M0:4l, and where b is greater than or 
equal to a+ 14. 




621889 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
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between 1 to 4 1 1 of SEQ ID NO:42, b is 
an integer of 1 5 to 425. where both a 
and b correspond to the positions of 
iiucieoiide residues shown in SEQ ID 
NO:42, and where b is greater than or 
equal to a + 14. 




624017 


Preferably excluded from the present 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 11 73 of SEQ 1DN0:43, b 
is an integer of 1 5 to 1 1 87. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:43, and where b is greater than or 
equal to a + 14. 


T61010. AA071044. AA088260, AA098798, 
AA107017 AA100707 AAI 11883 AAI 13305 
AA121495, AA133235, AA131438, AA13201 1, 
AA132866, AA143457, AA 146581, AAI 46805, 
AA146928, AA155613, AA155609, AA158090, 
AA 1 58263, AA 1 64694, AA 1 6559 1 , AA 1 76429, 
AA226820 


651784 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 501 of SEQ ID NO:44, b is 
an integer of 1 5 to 5 1 5, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:44, and where b is greater than or 
equal to a + 14. 


VV32583, W68240, W94174, AA25i670, 
AA25201 1. AA252266, AA425209 


651826 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b.. where a is any integer 
between 1 to 1485 of SEQ ID NO:45, b 
is an integer of 1 5 to 1499, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:45, and where b is greater than or 
equal to a + 14. 


T47384, T47385, T60137, T60194, T71947, 
T95050, T95146, R25340, R25476, R261 17, 
R26301, R27566, R27664, R28180, R33393, 
R35872, R35873, R36483, R48329, R48438, 
R62139, R62244, R66007, R66008, R66764, 
R70718, R70719, R73674, R73761, R74132, 
R76569, R76643, R77265, R77312, R78827, 
R79686, R79687, R813 16, R81751, H00804, 
H00891, H01415. H0I416, H02522, H03673, 
H13925, H13926, H24743, H26369, H26727, 
H26728, H27132, H27480, H27663, H28192, 
H28235, H41929, H41977, H42604, H43209, 
H43258, H45278, H45348, H53585, H53906, 
H61785 H6I786 H78337 H78338 H87337 
H87871, H95I83, N27090, N27092, N40499, 
N40502, N99158, W24165, W60193, 
AA039817, AA041344, AA074512, AA079058, 
AA079156, AA079157, AA085829, AA085974, 
AAI 00095, AAI 13304, AAI 42843, AAI 49898, 
AAI 56331. AAI 57820, AAI57895, AA158552, 
AA 1 59 1 77, A A 1 76093, AA 1 79607, AA 1 79608, 
AA 176333, AAI87637, AAI 86769, AAI 88622, 
AAI 88742. AAI 88975 


653282 


Preferably excluded from the present 
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invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 379 of SEQ ID'nO:46. b is 
an integer of 15 to 393. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:46. and where b is greater than or 
equal to a + 14. 




657122 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 224 of SEQ ID NO:47. b is 
an integer of 1 5 to 238, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:47, and where b is greater than or 
equal to a + 14. 




661442 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 925 of SEQ ID NO:48, b is 
an integer of 1 5 to 939, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:48, and where b is greater than or 
equal to a + 14. 


RI810L AA424721 


664914 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1757 of SEQ ID NO:49, b 
is an integer of 1 5 to 1 77 1 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:49, and where b is greater than or 
equal to a + 14. 


186944, T87027, Rl 1421, T81 153, T81380, 
R17243, R17453, R19171, R27826, R27927, 
R35295, R35940, R41854, R42800, R48191, 
R48192, R49457, R51209, R52247, R53413, 
R4I854, R42800, R49457, R55257, R55475, 
R59472, R71390, R81811, R81915, H05137, 
H07974, H30702, H42552, H57923, H58015, 
N71 127, N74282, N75329, N93224, W01557, 
W04382, W04780, W23438, W35253, W38865, 
AA176204, AA194869, AA199875, AA251414 


666654 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
'ormula of a-b, where a is any integer 
between 1 to 383 of SEQ ID NO:50, b is 
an integer of 15 to 397, where both a 
and b correspond to the positions of 
lucleotide residues shown in SEQ ID 
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NO:50, and where b is greater than or 
equal to a + 14. 




667084 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1621 of SEQ ID N0:5L b 
is an integer of 15 to 1635. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:5 1 , and where b is greater than or 
equal to a + 14. 


R71869, R71870. H22387. H27160, H465.92, 
H61204, H62108.N25274,N94410, AA026642, 
AA069188. AA069189, AA076423. AA076388. 
AA076533, AA076540, AA122346, AA121039, 
A A 1 2 1 092, AA 1 3 3 1 2 1 , AA 1 4347 1 , AA 1 4347o! 
AA143728. AA156363, AA156404, AA158498, 
A A 1 59 1 90, AA 1 5920 1 , AA 1 59286, AA 1 60335^ 
AA159837, AA159573, AA160367, AA159548^ 
AA 160456, AA 1 60697, AA 1 60789, AAl79329i 
A A 1 8 1 540, AA i 82669, A A 1 8688 1 , AA 1 86887^ 
AA188535. AA188540, AA190669, AA190973, 
AA191557. AA235457. AA45851 1, AA418203 


667380 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between I to 1766 of SEQ ID NO:52, b 
is an integer of 15 to 1780, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:52, and where b is greater than or 
equal to a + 14. 


T87574, RI0276, R10277, T79847, R49790, 
R49832, R59538, R59539, R86940, R87067, 
R87722, R98577, R98578. R99022i R99795! 
H72692. H93036, H93942, H93941, N54059, 
N62326. N647 1 9, N66726, N73888, N741 71 . 
N91734, N93505, W02054, W03949, W04337, 
W21317, AA 192562, AA 192563, AA223984, 
AA224049 


669530 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 476 of SEQ ID NO:53, b is 
an integer of 15 to 490, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NfO:53, and where b is greater than or 
equal to a+ 14. 


149160, T4916U H41659, R88196, W60799, 
W60930 AA046915 AA046972 AA069701 
AA464334 


671315 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1930 of SEQ ID NO:54, b 
is an integer of 15 to 1944, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:54, and where b is greater than or 
jqual to a+ 14. 




671993 

1 

J 


Preferably excluded from the present 
nvention are one or more 
3olynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 
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between 1 to 980 of SEQ ID NO:55, b is 
an integer of 15 to 994. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;55, and where b is greater than or 
equal to a + 14. 




674618 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 3 14 of SEQ ID NO:56, b is 
an integer of 15 to 328, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:56, and where b is greater than or 
equal to a + 14. 




675027 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between I to 1475 of SEQ IDNO:57, b 
is an integer of 15 to 1489, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 57, and where b is greater than or 
equal to a + 14. 


T86474, AA 133454, AA203346 


677202 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1269 of SEQ ID NO:58, b 
is ail integer of 15 to 1283, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO:58, and where b is greater than or 
equal to a + 14. 


T47486, T47487, T47666, T50413, T50493, 
T505 1 9, T5 1 852, T53234, T57067, T60776, 
T40856, T93579, T94432, T94435, T96391, 
R43542, R43542, H21618, H73240, H88867, 
H88868, H89122, H88868, H89122, N21997, 
N22243, N22815, N45720, N48998, N52063, 
N59239, N62103, N66419, N66708, N66782, 
N67139, N67283, N67447, N68047, N70159, 
N71 198, N74676, N76707, N78333, N80016, 
N92971, N93518, W05738, W45694, W48845, 
W80602, AA057801, AA063330, AA064827, 
AA065165, AA065I78, AA065179, AA069552, 
AA070491, AA070949, AA070969, AA071333, 
AA071358, AA074331, AA081280, AAl 1 1928, 
AAl 12051, AA132018, AA13212I, AA147357, 
AA 1 57065, AA 1 57085 , AA 1 57890, AA 1 60054, 
AA18I729, AA182765, AA187698, AA186444, 
AA 1 96 1 68, AA 1 96244, AA224 1 87 


678504 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 726 of SEQ ID NO:59, b is 
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an integer of 1 5 to 740. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:59, and where b is greater than or 
equal to a + 14. 




678985 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1277 of SEQ ID NO:60, b 
is an integer of 1 5 to 1 29 1 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:60, and where b is greater than or 
equal to a + 14. 




682161 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 957 of SEQ IDN0:6U b is 
an integer of 15 to 97 1, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:61 , and where b is greater than or 
equal to a + 14. 




683476 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 604 of SEQ ID NO:62, b is 
an integer of 1 5 to 6 1 8, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:62, and where b is greater than or 
equal to a+ 14. 




691146 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 124 of SEQ ID NO:63, b 
is an integer of 15 to 1 138, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
'^10:63, and where b is greater than or 
squal to a + 14. 


T48865, T48866, T48901, T47562, T48902, 
T54258, T54365, T69783, T70768, R080I2, 
R09058, R09059, T83437, T84082, T99021, 
R09059. R19174, R21551, R22562, R28286, 
R48757, R48758, R49683, R49683, R62406, 
R62407, R70222, R75607, R77000, R78400, 
R78401, R80802, HO2840, H03734, H24549, 
H26291, H26447,H27912, H43630, H47817, 
R83903, R83904, R94147, H49533, H49773, 
H50716, H50820, H87446, H87553, H93471, 
H93472, H98814, N22867, N32I37, N32762, 
N34334, N35009, N36932, N43763, N46205, 
N5225 1 , N56805, N72290, N95794, W027 13, 
W02886. W17176, W24905, W25571 , VV25688, 
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W67795, W72687, W72962, W77793, W79704. 
W81376. W8630L W86316, AA025519, 
AA025959, AA026653, AA029556, AA029704, 
AA079472, A A 1 2 1 306, A A 1 36679, AA 1 4868 1 , 
AA148680. AA 18 1745, AA425923 


693589 


Preferably excluded from the present 
invpntinn are one dv more 

11 1 V ^1 1 VI V^l 1 (11 V WllV \Jl lllWIV 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 404 of SEQ ID NO:64, b is 
an integer of 1 5 to 4 1 8, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:64. and where b is greater than or 
equal to a + 14. 




694991 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2822 of SEQ ID NO:65, b 
is an integer of 15 to 2836, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:65, and where b is greater than or 
equal to a + 14. 




698303 


Preferably excluded from the present 
invpnfinn are nnp or morp 

1 11 V 11 luii ai v> vjiiw \ji iiiv/it-> 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2291 of SEQ ID NO:66, b 
is an integer of 1 5 to 2305, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
C^O:66, and where b is greater than or 
equal to a + 14. 


183582, T844 1 7, T85606, R66380, R67 1 1 1 , 
R76298 H96019 H96020 N2S6SQ N2^6fil 
N34260, N34263, N70618, W05500, W 15421, 
W23670, W39659, AA015855, AA033569, 
AA033570, AA044566. AA044583, AA 178933, 
AA 179025 


698669 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1893 of SEQ lDNO:67, b 
is an integer of 15 to 1907, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 67, and where b is greater than or 
equal to a+ 14. 


T471 15, T471 16, R48786, R48893, R55495, 
R71847, R78934, R79033, R82776, H26587, 
H27077, R97760, H59232, H791 15, H791 16, 
N22948, N23658. N26858, N28757, N39967, 
N71599, W24648, W60157, W67490, W67491, 
W67815, W72921, W942I5, AA009634, 
AA026899, AA026900, AA029244, AA029040, 
AA031846, AA031847, AA032073, AA034285, 
AA034992, AA036865, AA037006, AA040908, 
AA039990, AA040521, AA040522, AA040773, 
AA043726, AA044071, AA044182, AA042948, 
AA043067, AA046606, AA046721, AA062914, 
AA074334, AA076039. AA076203, AA079763, 
AA079764. AA082550. AA085926, AA099318, 
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AA099836, AA102385, AA101039, AA101040, 
AA 11 257 K AA 1 1 2572, AA 1 14828, AA 1 1495 1 , 
AA 1 2800 K AA 1 28082, A A 1 26986, AA 1 28 1 34, 
AA 1 28459, A A 1 299 1 0, AA 1 3 1 403, A A 1 3 1 503, 
A A 1 47437, AA 1 47438 , AA 1 5096 1 , AA 1 5 1 05 1 , 
AA156785, AA156855, AA157912, AA157913, 
AA158544, AA158545, AA158554, AA158553, 
AA21 1822, AA460840, AA461 144 


705696 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 801 of SEQ ID NO:68, b is 
an integer of 15 to 815. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:68, and where b is greater than or 
equal toa+ 14. 


H20141, H20156, H20236, H20250, H49965, 
H50007, H50487, W92252, AA0451 16, 
AA134141, AA142968 


706393 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 136 of SEQ ID NO:69, b 
is an integer of 15 to 1 150, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:69, and where b is greater than or 
equal to a + 14. 


T48975, T51242, T51357, T59673, T59807, 
T62725, T62875, T72330, T97577, ROl 168, 
R21893, R22365, R35745, R41863, R41863, 
R63676, R6588I, R72862, R73334, R75659, 
R75767, H02871, H03430, H03512, H14924, 
H23660, H30020, H30277, H39675, H40069, 
H40278, H40526, H41667, H41700, H43170, 
H43670, H45130, H45I72. H45173, H45433, 
H46542 H46952 H46953 H62390 H78695 
H78777, H84781, H85405, H92309, N20534, 
N33402, N38945, N57790, N57945, N59752, 
W94488, W94489, AA044423, AA043057, 
AA081370, AA081371, AA099447, AAl 12623, 
AA 1 12622, AAl 43 199, AAl 43 2 14, AA 149467, 
AA149553, AA157049, AA157201, AA157952, 
AA157953, AA158049, AA158435, AA158837, 
AA158841, AA161074, AA161078, AA180395, 
AA251447, AA419021, AA428783, AA429093 


707357 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 330 of SEQ ID NO:70, b is 
an integer of 15 to 344, where both a 
and b correspond to the positions of 
lucleotide residues shown in SEQ ID 
^IO:70, and where b is greater than or 
iqual to a + 14. 




707360 

I 
< 


Preferably excluded from the present 
nvention are one or more 
Dolynucleotides comprising a nucleotide 
sequence described by the general 
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formula of a-b, where a is any integer 
between 1 to 434 of SEQ ID N0:7L b is 
an integer of 1 5 to 448. where both a 
and b conespond to the positions of 
nucleotide residues shown in SEQ ID 
N0:7 1 , and where b is greater than or 
equal to a + 14. 




707375 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 28 1 1 of SEQ ID NO:72, b 
is an integer of 15 to 2825, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:72, and where b is greater than or 
equal to a + 14. 


154 138, T65139, T65330, T80324, T83140, 
R00512. R00612. R19513, R31469, R31470, 
R47795, R77921, R78022, R800I2, H02327, 
H02429, H06404, H06405, H08607, H08608, 
H 1 4264, H 1 837o! H 1 9266,' H 1 9267^ H2 1 399! 
1-12 147 1. H47094, H47185, R85467, R87496, 
R8750L R8758L RS8189, R88226, R88227, 
N23376, N32357, N58463, N66212. N93661, 
N99103. W19083, W24383, W6860L W68602, 
W68723, W68745, AA016I49, AA040296, 
AA056973. AA135439, AA135519, AA135580, 
AA135856. AA158858, AA161 122, AA226730, 
AA226764, AA22747I, AA227481, AA232259 


707754 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 496 of SEQ ID NO:73, b is 
an integer of 1 5 to 5 1 0. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 73, and where b is greater than or 
equal to a + 14. 




711172 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 444 of SEQ ID NO:74, b is 
an integer of 15 to 458, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SJO:74, and where b is greater than or 
equal to a + 14. 




712248 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 363 of SEQ ID NO:75, b is 
an integer of 15 to 377, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:75, and where b is .greater than or 
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equal to a + 14. 




715445 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2056 of SEQ ID NO:76, b 
is an integer of 15 to 2070, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 76. and where b is greater than or 
equal to a + 14. 


TS8778. T97557, T97604, R17I89, R27615, 
R30849, R4I740. R486I6, R41 740, H 12351, 
R93768. R98882. R98972, H59983, N23156, 
N32736, N34539, N55086, N62785, N67224, 
N77297, N78823, N79734, W07252, W90651, 
AA037793, AA037794, AA055196, AA055286, 
AAi 13425, AA233917, AA234I65, AA258602, 
AA258548, AA42658I, AA429080 


716362 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
foimula of a-b, where a is any integer 
between 1 to 983 of SEQ ID NO:77, b is 
an integer of 1 5 to 997, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:77, and where b is greater than or 
equal to a + 14. 




716835 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1319 of SEQ IDNO:78,b 
is an integer of 1 5 to 1 333, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:78, and where b is greater than or 
equal to a + 14. 




716947 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 546 of SEQ ID NO:79, b is 
an integer of 15 to 560, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
>JO:79, and where b is greater than or 
equal to a + 14. 




717685 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 3 1 89 of SEQ ID NO:80, b 
s an integer of 15 to 3203, where both a 


T54040, N35800, W45d88, AAI22232, 

A A 1 2 II 09, A A 1 26030, A A 1 26 1 52, A A 1 556 1 8, 

AAI 55656 ■ 
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and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;80, and where b is greater than or 
equal to a + 14. 




719755 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 1696 of SEQ ID N0:8U b 
is an integer of 1 5 to 1710, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:8 1 , and where b is greater than or 
equal to a + 14. 




720389 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between I to 1365 of SEQ ID NO:82, b 
is an integer of 1 5 to 1 379, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:82, and where b is greater than or 
equal to a + 14. 




720903 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 664 of SEQ ID NO:83, b is 
an integer of 15 to 678, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 83, and where b is greater than or 
equal to a+ 14. 




721348 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2789 of SEQ ID NO:84, b 
is an integer of 15 to 2803, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
^0:84, and where b is greater than or 
squal to a + 14. 




721562 


Preferably excluded from the present 
nvention are one or more 
Dolynucleotides comprising a nucleotide 
sequence described by the general 





wo 00/55173 



PCT/USOO/05881 



95 





formula of a-b. where a is any integer 
between 1 to 1264 of SEQ ID NO:85, b 
is an integer of 15 to 1278, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:85, and where b is greater than or 
equal to a +. 14. 




722775 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2571 of SEQ ID NO:86, b 
is an integer of 15 to 2585, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:86. and where b is greater than or 
equal to a + 14. 




724463 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 371 of SEQ ID NO:87, b is 
an integer of 15 to 385, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:87, and where b is greater than or 
equal to a + 14. 




727501 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2486 of SEQ ID NO:88, b 
is an integer of 1 5 to 2500, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:88, and where b is greater than or 
equal to a + 14. 




728418 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1395 of SEQ IDNO:89, b 
is an integer of 15 to 1409, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
t^IO:89, and where b is greater than or 
equal to a + 14. 




728920 


Preferably excluded from the present 
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invention are one oi' more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1322 of SEQ ID NO:90, b 
is an integer of 15 to 1336, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:90, and where b is greater than or 
equal to a + 14. 




732958 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 773 of SEQ ID N0:9 1 , b is 
an integer of 15 to 787, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:91, and where b is greater than or 
equal to a + 14. 




733134 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1643 of SEQ ID NO:92, b 
is an integer of 15 to 1657, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:92, and where b is greater than or 
equal to a + 14. 


T49547, T49558, T49559, T49560, T49561, 
T49649, T49650, T70062, T70129, T75532, 
T95137, R17573, T27052, R19790, R42912, 
R52618, R53272, R42912, R59922, R59923, 
R65930, H08841, H08925, H47546, H47547, 
H47774, H47784, H481 19. H64949, H64950, 
H69959, H69960, H80517, H80569, H81281, 
H81337, H87618, H87619, H88959, H89042, 
H95657, H95712, H95729, H88959, H98860, 
N20108, N23582,N27446, N34733, N49675, 
N5 1 84 1 , N755 1 7, N78965, N93975, W053 1 0, 
W17334, W40344, W52084, W52929, W72818, 
W72819, W86046, W92307, W92294, 
AA009783, AA009892, AA022930, AA022980, 
AA024699, AA024734, AA037408, AA045887, 
AA045888, AA062821, AA081026, AA082088, 
A AO 82420, A A 1 0280 1 , AA 1 9986 1 , AA 1 9993 1 , 
AA220961, AA223217, AA223456, AA224153, 
AA224177, AA224137, AA224I38, AA224341, 
AA232349, AA232533, AA2321 17, AA458900, 
AA459095, AA463299 


734099 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 471 of SEQ ID NO:93, b is 
an integer of 15 to 485, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:93, and where b is greater than or 


R22895, H87448 
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equal to a + 14, 




734599 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
fonnula of a-b, where a is any integer 
between 1 to 750 of SEQ ID NO:94, b is 
an integer of 1 5 to 764. where both a 
and b coiTespond to the positions of 
nucleotide residues shown in SEQ ID 
NO:94, and where b is greater than or 
equal to a + 14. 




736019 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 693 of SEQ ID NO:95, b is 
an integer of 15 to 707, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:95, and where b is greater than or 
equal to a + 14. 


T41219, T50359, T56829, T58426, T58458, 
T60928, T60984, T64158, T64287, R27157, 
H03484, H03579, H22546, H22547, H28310, 
H44067. H44146, R83796, H48481, H48645, 
H57243, H66I62, H66I63, H82370, N21 1 10, 
N21 188, N2746K N29155, N29743, N31 124, 
N32398, N39884, N56818,N57165,N57228, 
N57403, N68904, N73978, N77833, N93027, 
N93818. N671 12, W00894, W00923, W02234, 
W 16676, W21379, W44969, AA064843, 
AA070697, AA070876, AA071332, AA071265, 
AA076379, AAO76308, AA079524, AA079572, 
AA08 1231, AA08 140 1 , A A083774, AA083775, 
AA130308, AA 130309, AA132056, AA132160, 
AA143I32, AA146882, AA146883, AA165057, 
AA164722, AA166939, AA181 133, AA187371, 
AA 187804, AA 188 1 18, AA 186447, AA 186448, 
AA 1 87 1 05, AA 1 87 1 50, AA 1 88273 


738268 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprismg a nucleotide 
sequence described by the general 
fonnula of a-b, where a is any integer 
between 1 to 801 of SEQ ID NO:96, b is 
an integer of 15 to 8 15, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
^0:96y and where b is greater than or 
equal to a + 14. 


T48287, T48288, T54477, T545 1 1 , R34064, 
R36907, R49496, R49496, R75625, R75724, 
H12225, H16384, H19466, H19543, H42166, 
H42988, H54780, H99297, N22733, N26471, 
N74933, N93468, W15461, W47542, W47590, 
N90997, AA010700, AAO 10701, AA056728, 
AA088699, AAI26219, AA132934, AA156291, 
AA165516, AA165558, AA 1 76293, AA 173448, 
AA 189056, AA2335 1 5, A A45983 1 , AA46001 1 


738911 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
ibrmula of a-b, where a is any integer 
between 1 to 644 of SEQ ID NO:97, b is 
an integer of 15 to 658, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
^10:97, and where b is greater than or 
squal to a + 14. 


H22593, H52836 
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739226 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 235 of SEQ ID NO:98, b is 
an integer of 15 to 249, where both a 
and b coiTespond to the positions of 
nucleotide residues shown in SEQ ID 
NO:98, and where b is greater than or 
equal to a + 14, 


T57824,N63155. AA027845 


739527 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 738 of SEQ ID NO:99, b is 
an integer of 15 to 752. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:99, and where b is greater than or 
equal to a + 14. 




740710 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 3045 of SEQ ID NO: 100, 
b is an integer of 15 to 3059, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 00, and where b is greater than or 
equal to a+ 14. 




742980 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1668 of SEQ ID NO:101, 
b is an integer of 15 to 1682, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 101, and where b is greater than or 
equal to a + 14. 


T71993, R1290K R40053, H14591, H14696, 
R83485, H50584, H50585, H89958, H89966, 
H89973, H89980, N26005, N34777, N36638, 
N36637, N44503, N67682, N7612I, N79613, 
W03491, W05571, W31276, W49653, W49727, 
AA009708, AA009798, AA035612, AA042894, 
AA043030, AA062953, AAl 15370, AA133278, 
AA 1 8 1 268, AA 1 8 1 269, A A 1 93206 


744331 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 924 of SEQ ID NO:l02,b 
is an integer of 15 to 938, where both a 
and b correspond to the positions of 


R25354, R49789, R71735, R71740, H73502, 
H79224, H87423, H99515, H99516, N2475K 
N32707, N4451 K N52325, N67764, N75095, 
N93879, W40372. W69127, W69094, W74698, 
W74736, AA026984, AA035176, AAl 49088, 
AA262739, AA464357, AA430724 
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nucleotide residues shown in SEQ ID 
NO: 102. and where b is greater than or 
equal to a + 14. 




74475 1 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1998 of SEQ lDNO:"i03, 
b is an integer of 15 to 2012. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO; 103. and where b is greater than or 
equal to a + 14. 




745750 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1080 of SEQ ID NO:l04, 
b is an integer of 15 to 1094, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 104, and where b is greater than or 
equal to a -i- 14, 




746285 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2283 of SEQ ID NO: 105, 
b is an integer of 15 to 2297, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 105, and where b is greater than or 
equal to a + 14. 


T87719, T87928, R99975, R99976, H647I4, 
H65205, H92423, H65205, N47296, N48612, 
N58085, N58926, N64294, N64508, N72401, 
N80294, N93405, W04791, W21447, W94582, 
W95317, AA024856, AA024939, AA037672, 
AA037673, AA070416, AA075508, AA075507, 
AA101263, AA148029, AA147953, AAI69726, 
AA 17 1461, AA 173095. AA464821 


746416 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 428 of SEQ ID NO: 106, b 
is an integer of 15 to 442, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 06, and where b is greater than or 
squal to a + 14. 




747851 


Preferably excluded from the present 
nvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 


N44767, W44754 
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between I to 1005 of SEQ LD NO: 107, 
b is an integer of 15 to 1019, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 07, and where b is greater than or 
equal to a + 14. 




750632 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 697 of SEQ ID NO: 108, b 
is an integer of 15 to 7 1 1 , where both a 
and b coirespond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 08» and where b is greater than or 
equal to a + 14. 


H48882, W23677, W351 10, AA 133857 


751315 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 729 of SEQ ID NO: 1 09, b 
is an integer of 1 5 to 743, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 109, and where b is greater than or 
equal to a + 14. 




754009 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 781 of SEQ ID NO: 1 10, b 
is an integer of 15 to 795, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 10, and where b is greater than or 
equal to a + 14. 




754634 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1318 of SEQ IDN0:11U 
3 is an integer of 1 5 to 1 332, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 111, and where b is greater than or 
;qual to a + 14. 


N21429 


756637 


Preferably excluded from the present 
nvention are one or more 


M44651, W76461 
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polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 729 of SEQ ID NO: ll 2, b 
is an integer of 15 to 743, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 12, and where b is greater than or 
equal to a + 14. 




756833 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1676 of SEQ ID NO: n3, 
b is an integer of 15 to 1690, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 113, and where b is greater than or 
equal to a + 14. 




756878 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 606 of SEQ ID N0:1 H, b 
is an integer of 15 to 620, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 14, and where b is greater than or 
equal to a+ 14. 


R12122 


757332 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 528 of SEQ ID NO: 1 15, b 
is an integer of 1 5 to 542, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:1 15, and where b is greater than or 
equal to a + 14. 




760835 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 51 1 of SEQ ID NO: 116. b 
is an integer of 15 to 525, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
HO: 116, and where b is greater than or 
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equal to a + 14. 




761760 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 714 of SEQ iDNOilH, b 
is an integer of 1 5 to 728, where both a 
and b coirespond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 17, and where b is greater than or 
equal to a + 14. 




762520 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 934 of SEQ ID NO: H 8, b 
is an integer of 15 to 948, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 1 8, and where b is greater than or 
equal to a + 14. 


T866 1 7 , T866 1 8, R478 1 4. R4996 1 , R7 1 92 1 , 
R71968 H78225 H28275 R94939 R95025 
R97173, R97174, R99726, R99904, H52435. 
H52436, H58879, H58880, H66345, H66395, 
H80709, H80710, W87663, W87664, 
AA046620. AA046867, AA055456, AA 102380, 
AA121314, AA 1 50579, AA 197300 


764461 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 97 of SEQ ID NO: 1 1 9, b 
is an integer of 15 to 2 1 1, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 19, and where b is greater than or 
equal to a + 14. 




764517 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1294 of SEQ ID NO: 120, 
b is an integer of 15 to 1308, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 120, and where b is greater than or 
equal to a + 14. 




765132 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2502 of SEQ ID NO: 121, 
5 is an integer of 1 5 to 25 1 6, where both 
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a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 121, and where b is greater than or 
equal to a + 14. 




765661 


Preferably excluded from the present 
inventinn 3rf* nnp nr more 
polynucleotides comprising a micleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1 125 of SEQ IDNO:l22, 
b is an integer of 1 5 to 11 39, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 22, and where b is greater than or 
equal to a + 14. 


T8169I,N27595 


767113 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2 100 of SEQ ID NO:l23, 
b is an integer of 15 to 21 14, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 123, and where b is greater than or 
equal to a+ 14. 




767204 


Preferably excluded from the present 

iiivdiiiv^il alC UUC KJl IllUlC 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 569 of SEQ ID NO: 1 24, b 
is an integer of 15 to 583, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 124, and where b is greater than or 
equal to a + 14. 




767400 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1973 of SEQ ID NO: 125, 
b is an integer of 15 to 1987, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 25, and where b is greater than or 
equal to a + 14. 




767962 


Preferably excluded from the present 
nvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 


T59753, R2I255, R21256, R23274, R23364, 
R71913, R7 1956, HI 2633, HI 2686, H99087, 
N26954, N335I8, N43798, N62998, N66835, 
N7 1 1 24, N7 1 1 56, N74 144, N79907, WO 1 554, 
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formula of a-b, where a is any integer 
between 1 to 1437 of SEQ ID NO:l26, 
b is an integer of 1 5 to 1 45 K where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 126, and where b is greater than or 
equal to a + 14. 


W05537, W19994, W44368, W46357, W46193, 
VV47163. \V47284, W52537, W55854, W80804, 
WS0878, W92021, W92022, N90420. 
AA002178, AA022578, AA022579, AA029899, 
AA029987, AA03418L AA036856, AA036913, 
AA043237, AA043566, AA071518, AA082340, 
AA 122 159, AA 120962. AA 146944, A A 147449, 
A A 1 4808 L AA 1 5 1 266, AA 1 5 1 267, AA 1 56459 


768040 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1220 of SEQ ID NOM27, 
b is an integer of 15 to 1234, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 127, and where b is greater than or 
equal to a + 14. 




769956 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 849 of SEQ ID NO: 128, b 
is an integer of 1 5 to 863, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 128, and where b is greater than or 
equal to a + 14. 


R68817, R68925, R75906, H14626, H82146, 
H93109, H93237, N32098, N35721, N45410, 
N75570, W03043, W04850, AA029607, 
AA26286K AA463956, AA464092 


770133 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1224 of SEQ ID NO: 129, 
b is an integer of 15 to 1238, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
[^0:129, and where b is greater than or 
equal to a+ 14. 




770289 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 365 of SEQ ID NO: 130, b 
IS an integer of 15 to 379, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
^0: 130, and where b is greater than or 
2quai toa+ 14. 
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Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1772 ofSEQ lDNO:nK 
b is an integer of 15 to 1786, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 131, and where b is greater than or 
equal to a -r 14. 


T53984. T55243, T51230, T77632, T91326. 
T808 1 9, T8 1219. T84909, T95454, T97320, 
T99226. T99269, R16575, R16634, R 19765, 
R229S7, R23096, R33095, R33188, R37437, 
R39255, R45185, R45185, R62594, R62642. 
H0389L H03892, H08679, H08680, H20556, 
H20650. H46154, H46155, R88298, R90733, 
R90759. R92224. R92332, R97325, H57663, 
H58503. H61709, H61913, H62747, H66685, 
H68924, H68954, H80053, H83342, H95786, 
H96135, N20464, N20472, N24026, N25491, 
N35235, N35419, N38769, N44900, N48399, 
N53 146, N55089, N55095, N57767, N58580, 
N59732, N63942, N70290, N7 1 759, N74938, 
N77'i00 NI0R4I1 W^lMf) Wf^l^/^O 

W56557. W56635, W56598, W56594, W73408, 
W74230, W79843, W93916, AA031492, 
AA070868, AA071019, AA088788, AA100685, 
AAl 12926, AAI76829, AA176851, AA193034, 
AAl 94065, AAl 94 180, AAl 94579, AAl 94703, 
AAl 954 1 6, AAl 95532, AA233792, AA233783, 
AA233900, AA233920, AA234128, AA234169, 
AA252704, A A25283 1 , AA4 1 6743, AA4 18391, 
AA4 18440 


772582 


Preferably excluded from the present 

invpntinn J*rf* nnf* or mnr'P 

i 1 1 V ^llll V/ll Ml & Ull^ \JL iH\Jk\i 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between i to 960 of SEQ ID NO: 132, b 
is an integer of 15 to 974, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
f^O: 132, and where b is greater than or 
equal to a + 14. 




773387 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 620 of SEQ ID NO: 1 33, b 
is an integer of 1 5 to 634, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 33, and where b is greater than or 
equal to a+ 14. 




773827 

1 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 
between 1 to 1841 of SEQ ID NO: 1 34, 
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b is an integer of 15 to 1855, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO; 134, and where b is greater than or 
equal to a + 14. 




774108 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 903 of SEQ ID NO: 135, b 
is an integer of 15 to 91 7, where both a 
and b coixespond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 35, and where b is greater than or 
equal to a + 14. 


T96288, R3 1 388, R32886, R63543, R63597, 
R758 1 K R758 1 2, H20285, H20509, H20599, 
H21238, H24872, H29854, H29945, H41103, 
H4I208, H44188, H44189. R85628, R91367, 
H83459, H8357L H97165. H97164, N25639, 
N29652, N29777, N32407, N324 1 3, N32580, 
N32835! N4i9l8, N42281, N56607, N57152, 
N57196, N69818, N70613, N93340, N93928, 
N94454, W24358, W25163, W30800, W37904, 
W37964, W40428, W6863 U W68632, W70339, 
W80994, W81096, W81716, W81253, W81543, 
W81544, W94206, AA004372, AAOl 1346, 
AA016002, AA028888, AA029626, AA029627, 
AA044028, AA044350, AA062804, AA081035, 
AA13I270, AA131354. AA131371 


774636 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1257 of SEQ ID NO: 136, 
b is an integer of 1 5 to 127L where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 136, and where b is greater than or 
equal to a + 14. 


T54747, T69827, RI4146, R50592, R55502, 
R736I5. R73937, H41540, R84981, R85103, 
R87495, R88553, R88554, R88556, R88818, 
R88839, R89675, R9I235. H51003, H51004, 
H5 1 58 1 , H79057, N70799, W02680, 
AA232327, AA232417, AA464467 


775339 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2003 of SEQ ID NO: 137, 
5 is an integer of 15 to 2017, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 137, and where b is greater than or 
equal to a+ 14. 




775582 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 923 of SEQ ID NO: 1 38, b 
IS an integer of 1 5 to 937, where both a 
and b correspond to -the positions of 
aucleotide residues shown in SEQ ID 
^40: 138, and where b is greater than or 


T62486, T62631, H 14642, R85991, H73603, 
N549 1 2, N68727, N80228. N9 1 6 1 7, W385 1 8, 
W67302, W674 1 8, AA 1 7 1 395, AA2 1 4500, 
AA215291, AA464035 
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equal to a + 14. 




775779 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2745 of SEQ ID NO:l39, 
b is an integer of 1 5 to 2759, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 139, and where b is greater than or 
equal to a + 14. 




777809 


Preferably excluded from the present 

invention arp nnp or mnrp 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1227 of SEQ ID NOiKO, 
b is an integer of 15 to 1241, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 140, and where b is greater than or 
equal to a + 14. 




778927 


Preferably excluded from the present 

invention are ont* nr more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 3391 of SEQ ID NO: 141, 
b is an integer of 1 5 to 3405, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 141, and where b is greater than or 
equal to a + 14. 


150777, T50939, R11800, R 197 13, R3I403, 
R32898 R44269 R44269 R554'il Rfifln41 
R60103, R69554, R74340, R74434, H20427, 
H266I5, H26660, H42495, H43482, R85644, 
H5I488, H68618, N58I57, N58231, N776I I, 
W39692, W45048, W56828, W57633, 
AA052900, AA057808, AA074705, AA 122 120, 
A A 1 2 1 079, A A 1 2 1 23 1 , AA25905 1 , AA464470 


779262 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2254 of SEQ ID NO: 142, 
b is an integer of 1 5 to 2268, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NJO: 142, and where b is greater than or 
equal to a + 14. 


Rl 1844, R71241, R7I292, H00159, H88551. 
H90726 H98059 N28770 N58442 N78033 
W3267I, AA035075, AAI 12651, AAl 12652, 
AA130035, AA215309, AA251209 


779392 


Preferably excluded from the present 
invention are one or more 
3oiynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1743 of SEQ ID NO: 143, 
? is an integer of 15 to 1757, where both 


R25284, R36255, R36256, R42970, R46635, 
R42970, R46635, H28773, N52867, N70541, 
N77890, W05403, W05783, AA085067, 
AA085066, AA204650, AA210753, AA21 1713, 
AA251462, AA252456, AA460350, AA460780 
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a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 143, and where b is greater than or 
equal to a + 14. 




780149 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1048 of SEQ IDN0:M4, 
b is an integer of 15 to 1062, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 144, and where b is greater than or 
equal to a + 14. 




780583 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1016 of SEQ ID NO: H5, 
b is an integer of 15 to 1030, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 145, and where b is greater than or 
equal to a + 14, 




780960 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 800 of SEQ ID NO: 146, b 
is an integer of 1 5 to 8 14, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 146, and where b is greater than or 
equal to a + 14. 




781469 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
i'ormuia of a-b, where a is any integer 
between I to 2664 of SEQ ID NO: 147, 
b is an integer of 15 to 2678, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
W: 147, and where b is greater than or 
equal to a + 14. 


T9579I, H18820, H19074, H22604, H40723, 
H45802, H46056, H47074, H47156, H86819, 
H86886, H88675, H88724, H88972, H89058, 
H88972, N28987, N36053, N39668, N47281, 
W19145, W68543, W68544, N91577, 
AA044679, AA044896, AA43001 1 


781556 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 


T94861, T94906, R21516, R26869, R27098, 
R36258, R37965, R37966, R78I72, H03413, 
H041 16, H!4531, H45546, R96826, R98130. 
N51409, N52365, N64272, N74939, N75136, 
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formula of a-b, where a is any integer 
between 1 to lOUofSEQ IDNO:l48, 
b is an integer of 15 to 1028. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 148, and where b is greater than or 
equal to a + 14. 


VV23556, W35208, AA187823, AAI91525, 
AA429367 


781771 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a -b, where a is any integer 
between 1 to 141 1 of SEQ ID NO:l49. 
b is an integer of 15 to 1425. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 149, and where b is greater than or 
equal to a + 14. 


T95420, T99529, R50341; R52125, R72608, 
R72630, R72677, R7270L H26733, H26734. 
H30106, H59788, H8244K N75150, W42750, 
W42840 


782033 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 766 of SEQ ID NO: 150, b 
is an integer of 15 to 780, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 50, and where b is greater than or 
equal to a + 14. 


H53100, H53207, H97410, H98035. N30753, 
N68541, W42491, W4264K W57808, 
AA046603, AA046753, AA 136886, AA 136997, 
AA 1434 1 9, AA 143420 


782105 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 052 of SEQ ID NO: 1 5 1 , 
0 is an integer of 15 to 1066, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 5 1 , -and where b is greater than or 
equal to a+ 14. 


R97486, H72940, W90139 


782122 

I 
I 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
3etween 1 to 1635 of SEQ ID NO: 152, 
0 is an integer of 15 to 1649, where both 
\ and b correspond to the positions of 
lucleotide residues shown in SEQ ID 
MO: 152, and where b is greater than or 
jqual to a + 14. 


T54379, T60348, T61029, T54271, T57801, 
R10793, T78907, T78959, R49078, R55635, 
R67844, R67845, R69587, R72600, R72666, 
H04742, H04830, H16978, H24654, H26129, 
H26308, H26395, H26467, H28100, H28205, 
H28252, H28895, H28896, H30485, H39554, 
H42595, H42603, H42662, H43740, H44345, 
H44346, H44546, H44547, H44960, H45012, 
H45860, R88120, R88214, H51204, H58080, 
H58081, H64553, H64654, H70033. H70034, 
^86451, H70034, H99833, N24525, N29867, 
M30752, N35500, N39259. N42463, N44804, 
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N52550, N539S5, N57289, N58726, N63349, 
N67624, N67663. N68 1 57. N70299, N806 15, 
N93230, N94595, N98489, W 19633, W23803, 
W25087, W3I034, W37981, W37982, W42579, 
W44389! W49677, W57614, W5787U W58142, 
W6778K W67840, W68147, W68474, W68699. 
W68791, VV69717, W80749, W80837, N89879, 
AA025233. AA025568, AA025686, AA026026, 
AA033846. AA039625, AA039693, AA046842, 
AA047013. AA057608. AA057676, AA064637, 
AA064680. AA074448, AA083591, AA098837, 
AA102 142, AAl 13374, AAl 13402, AAI 15525, 
AAI 14948, AA128972, AAI28973, AA133142, 
AA 146949. AAl 48086, AA 149283, AA149377, 
A A 1 600 1 2, AA 1 60688, A A 1 72 1 44, AA 1 80932, 
AAl 82561 


783135 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 646 of SEQ ID NO: 153. b 
is an integer of 15 to 660, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 153, and where b is greater than or 
equal to a + 14. 




783245 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 591 of SEQ ID NO: 154, b 
is an integer of 15 to 605, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 54, and where b is greater than or 
equal to a + 14. 




783247 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 681 of SEQ ID NO; 155, b 
is an integer of 15 to 695, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SfO: 155, and where b is greater than or 
equal to a + 14. 


AAl 55638 


783413 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 


H58751, H93683, H93684, N93167, W19186, 
W19958, W3877I,N91367 
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sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 766 of SEQ ID NO: 156, b 
is an integer of 1 5 to 780, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 56, and where b is greater than or 
equal to a + 14. 




784407 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1 1 1 3 of SEQ ID NO: 1 57, 
b is an integer of 15 to i 127, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 57, and where b is greater than or 
equal to a + 14. 




784548 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1268 of SEQ ID NO:l58, 
b is an integer of 15 to 1282, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 158, and where b is greater than or 
equal to a + 14. 




785075 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1491 of SEQ ID NO: 159, 
b is an integer of 15 to 1505, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 1 59, and where b is greater than or 
equal to a+ 14. 




785677 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 722 of SEQ ID NO: 160, b 
is an integer of 15 to 736, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 60, and where b is greater than or 
squal to a+ 14. 
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786238 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 98 1 of SEQ ID NO: 16 1, b 
is an integer of 1 5 to 995, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 161, and where b is greater than or 
equal to a + 14. 




786389 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 1 1 1 of SEQ ID NO:l62, 
b is an integer of 15 to 1 125, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 162, and where b is greater than or 
equal to a + 14. 




786929 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 409 of SEQ ID NO: 163, b 
is an integer of 15 to 423, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 63, and where b is greater than or 
equal to a + 14. 




786932 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1628 of SEQ ID NO: 164, 
3 is an integer of 15 to 1642, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
^0: 164, and where b is greater than or 
squal to a + 14. 




787078 

1 
I 


Preferably excluded from the present 
nvention are one or more 
3olynucleotides comprising a nucleotide 
iequence described by the general 
brmula of a-b, where a is any integer 
between 1 to 1 101 of SEQ ID NO: 1 65, 
3 is an integer of 15 to 1115, where both 
i and b correspond to the positions of 
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nucleotide residues shown in SEQ ID 
NO: 165, and where b is greater than or 
equal to a + 14. 




787139 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1052 of SEQ ID NO:l66, 
b is an integer of 15 to 1066, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 166, and where b is greater than or 
equal to a + 14. 




787283 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 643 of SEQ ID NO: 167, b 
is an integer of 1 5 to 657, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 167, and where b is greater than or 
equal to a + 14. 


R22724 

• 


788761 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1012 of SEQ ID NO:168, 
b is an integer of 15 to 1026, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 168, and where b is greater than or 
equal to a + 14. 




788988 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between i to 760 of SEQ ID NO: 169, b 
is an integer of 1 5 to 774, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
W: 169, and where b is greater than or 
equal to a + 14. 




789092 


?*referably excluded from the present 
nvention are one or more 
5oIynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 


AA234588 



\ 
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between 1 to 388 of SEQ ID NO: 1 70, b 
is an integer of 15 to 402. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO; 170, and wliere b is greater than or 
equal to a + 14. 




789298 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 782 of SEQ ID NO: 1 7 L b 
is an integer of 15 to 796, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 171, and where b is greater than or 
equal to a + 14. 




789299 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 464 of SEQ ID NO: 1 72, b 
is an integer of 1 5 to 478, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 72, and where b is greater than or 
equal to a + 14. 




789718 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 642 of SEQ ID NO: 1 73, b 
is an integer of 15 to 656, where both a . 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 173, and where b is greater than or 
equal to a + 14. 




789957 


Preferably excluded from the present 
invention are one or more 
3olynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1877 of SEQ ID NO: 174, 
3 is an integer of 15 to 1891, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
"^0: 1 74, and where b is greater than or 
2qual to a + 14. 


T5I260, T6194I, T62167, T77034, T90753, 
R38108, N32708, N92379, W24621, W42543, 
W42478, AA 128007, AA 128031, AA 134234,' 
AA424998 


789977 


Preferably excluded from the present 
nvention are one or more 


T56442, T78292, R37940, R56008, R56009, 
R56573, R56574, Hi 1080, N34431, N48665. 
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polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2147 ofSEQ IDNOiHS. 
b is an integer of 15 to 2161, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 75. and where b is greater than or 
equal to a + 14. 


AA010749, AAOl 1 177, AA070806, AA070882, 
AA146859. AA147636, AAI47691, AA164223, 
A A 1 64224, AA2 1 0729, A A2 1 0859, AA243063, 
AA243070. AA464493, AA464494 


790285 


Preferably excluded from the present 
inventinn firp nnp fir more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2397 of SEQ ID NO:! 76, 
b is an integer of 15 to 24 11, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 176, and where b is greater than or 
equal to a + 14. 


T66279, T66328, T84I64, T85098, R24232, 
R24233 H03657 H03658 H98526 HQ8S^6 
H99618. N22728, N29400, N32172, N33953, 
N41460, N69471, N70552, N73722, W03893, 
VV44579. W72407, W76486, W78102, W79410, 
N90963, AA0448I6, AA044841, AA086039, 
AA086121. AA088877, AA 102298, AA 130887, 
AAI31529. AAI31603, AA181784. AA182515, 
AAI90450. AA191392, AA223757 


790509 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1324 of SEQ ID NO: 1 77, 
b is an integer of 15 to 1338, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 77, and where b is greater than or 
equal to a + 14. 


T68040. H17760, AA101036, AA129837 


790775 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1600 of SEQ ID NO: 178, 
b is an integer of 15 to 1614, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 78, and where b is greater than or 
equal to a + 14. 


N25320, N31 432, W8 1044, W81097 


790888 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 4278 of SEQ ID NO: 179, 
b is an integer of 15 to 4292, where both 
d and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
vIO: 179, and where b is greater than or 


R 14550 R 15204 T26493 R2I597 R22908 
R23010, R4I21 1, R41649, R43371, R4121 1, 
R41649. R43371, R58989, R59048, H05739, 
H05845, H17266, H17265. H23579. H44104, 
H46505, H47043, H58955, H59002, H73676, 
H73730. H80078, H82275, H82289, H82399, 
H82381, H97810, H98I33, H98737, N231 17, 
N243 1 0. N25 196, N25265, N27792, N28735, 
N29893, N33395, N33904, N36066, N36839, 
N42542. N46060, N51230, N59535, N67737. 
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equal to a + 14. 


N7364L N78481. N78694, W03555, W 15202, 
W52445. W52723, W95124. AA047257, 
AA057142, AA204699. AA251464, AA430598 


791506 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 229 of SEQ ID NO: 1 80, b 
is an integer of 15 to 243, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 180, and where b is greater than or 
equal to a + 14. 




791649 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 799 of SEQ ID NO: 1 8 1 , b 
is an integer of 1 5 to 8 1 3, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 18 1 , and where b is greater than or 
equal to a -H 14. 




791802 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 808 of SEQ ID NO: 1 82, b 
is an integer of 15 to 822, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 1 82, and where b is greater than or 
equal to a + 14. 




792002 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 08 1 of SEQ ID NO: 1 83, 
b is an integer of 15 to 1095, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 183, and where b is greater than or 
equal to a + 14. 


T49735, T49736, T95310, T9539I, T99384, 
T99612, R63493, R63494, H27739, R91698, 
R92136, H52608, H57619, H58464, H61415, 
H62139, H69019, H87167, H87669, N21358, 
N70307, N79596, W19063, W58498, W58651, 
W79687, W81289, AA099849, AA099972, 
AA232767 


792291 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 


T55436, R21797. R22403. R22452, R229I6, 
R23020, R76901, R77068,H22573,H25752, 
H25866, R83900, H50717, H50821, H64026, 
H64791, H95702, N64545. N69769, N74704, 
N80341, W05092, W79489, W79634, 
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between I to 3661 of SEQ ID NO: 184, 
b is an integer of 15 to 3675. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 184. and where b is greater than or 
equal to a + 14. 


AA005055, AA005007. AA025043, AA03671 1, 
AA037127, AA043916. AA055100, AA063627, 
AA069I42, AA069230. AA069323, AA069376, 
AAl 12277, AAI 12531. AAl 15279, AA151238, 
AA 1 5 1 239, AA 1 5 1 582, AA 1 49398, AA 1 4996 1 , 
AA 1 50069, AA 1 58029. AA 1 5832 1 , AA 1 58692, 
AA158693, AA161232, AA236787, AA236834, 
AA256776. AA261961 


792371 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1026 of SEQ ID NO: 185, 
b is an integer of 15 to 1040, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO; 185, and where b is greater than or 
equal to a + 14. 




792660 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 803 of SEQ ID NO: 186, b 
is an integer of 1 5 to 8 1 7, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 186, and where b is greater than or 
equal to a + 14. 


T59054, T86590, T8327K R48677, R53483, 
R53482, R62329, R62330, R66651, R67372, 
R69095, R69210, R71144, R82632, R82676, 
HI5764, H15765, H195I8. H19605, H27898, 
H42872, H42936, H49329, H49330, H50062, 
H50061, H87268, H87324, H96667, N22675, 
N92574, W37223, W37563, W38866, W61 1 19, 
W65380, AA035095, AA035635, AA037254, 
AA054951, AA062973, AA082301, AAI32472 


792782 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1066 of SEQ ID NO: 1 87, 
b is an integer of 15 to 1080, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 187, and where b is greater than or 
equal to a+ 14. 




792890 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1272 of SEQ ID NO: 188, 
b is an integer of 1 5 to 1 286, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 188, and where b is greater than or 
equal to a + 14. 


AA251351 
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792931 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1724 of SEQ ID NO:l89. 
b is an integer of 15 to 1 738, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 189, and where b is greater than or 
equal to a + 14. 




792943 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1909 of SEQ IDNO:190, 
b is an integer of 15 to 1923, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 190, and where b is greater than or 
equal to a + 14. 




793104 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 236 of SEQ IDN0:191,b 
is an integer of 15 to 250, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 191, and where b is greater than or 
equal to a+ 14. 




793445 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1888 of SEQ ID NO: 192, 
b is an integer of 15 to 1902, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 192, and where b is greater than or 
equal to a + 14. 


AA034998, AA044249, AA08883O, AA429418 


793446 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 546 of SEQ ID NO: 193, b 
is an integer of 15 to 560, where both a 
and b correspond to the positions of 


157765. T60664, HO 1264, H45774, H54790, 
H54842, H64484, H64485, N98810, W58332, 
W58653, W74582, W79320, W79420, W79565, 
W92452, AA027210, AA027209, AA029725, 
AA029663, AA088693, AA 12 1506, AA 127731, 
AA428362 
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nucleotide residues shown in SEQ ID 
NO: 193, and where b is greater than or 
equal to a + 14, 




793639 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 576 of SEQ ID NO: 194. b 
is an integer of 15 to 590, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 194, and where b is greater than or 
equal to a + 14. 


N6988K N93023, N98853, W21375, W73944, 
W77988, AAl6953a AA169837, AA176453, 
AA 176931 


794213 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 677 of SEQ ID NO: 195, b 
is an integer of 1 5 to 69 1 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 195, and where b is greater than or 
equal to a+ 14. 


N53897,N553I8 


795858 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1758 of SEQ ID NO: 196, 
b is an integer of 15 to 1772, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 196, and where b is greater than or 
equal to a + 14. 




795955 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 661 of SEQ ID NO: 197, b 
is an integer of 15 to 675, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 197, and where b is greater than or 
equal to a + 14. 




796359 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide, 
sequence described by the general 
formula of a-b, where a is any integer 
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between 1 to 543 of SEQ ID NO: 198, b 
is an integer of 15 to 557. where both a 
and b con'espond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 198, and where b is greater than or 
equal to a + 14. 




796555 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2597 of SEQ ID NO: 199, 
b is an integer of 15 to 261 1, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 199, and where b is greater than or 
equal to a + 14. 


T69136, T69194, T95612, T95713, R53091, 
R73 1 26, N4 1 876, N49 1 74, W05348, W04725, 
W31397, W31827, W92674, AA039513 


796675 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2302 of SEQ ID NO:200, 
b is an integer of 15 to 2316, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:200, and where b is greater than or 
equal to a+ 14. 




796743 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 133 of SEQ ID NO:201, 
b is an integer of 1 5 to 1 147, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
[^0:20 1, and where b is greater than or 
equal to a+ 14. 




796792 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
]'ormula of a-b, where a is any integer 
between 1 to 674 of SEQ ID NO:202, b 
is an integer of 1 5 to 688, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:202, and where b is greater than or 
equal to a + 14. 




799668 


Preferably excluded from the present 
invention are one or more 
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polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 290 of SEQ ID NO:203, b 
is an integer of 15 to 304, where both a 
and b con*espond to the positions of 
nucleotide residues shown in SEQ ID 
NO:203, and where b is greater than or 
equal to a + 14. 




799669 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 403 of SEQ ID NO:204, b 
is an integer of 15 to 417, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:204, and where b is greater than or 
equal to a + 14. 




799673 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 537 of SEQ ID NO:2b5, b 
is an integer of 15 to 55 1, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:205, and where b is greater than or 
equal to a+ 14. 




799674 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general . 
formula of a-b, where a is any integer 
between 1 to 1087 of SEQ ID NO:206, 
b is an integer of 1 5 to 1101, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
KO:206, and where b is greater than or 
equal to a + 14. 




799678 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 501 of SEQ 1DNO:207, b 
is an integer of 15 to 515, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO:207, and where b is greater than or 
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equal to a + 14. 




799728 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 255 of SEQ ID NO:208, b 
is an integer of 15 to 269. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:208, and where b is greater than or 
equal to a + 14. 




799748 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 720 of SEQ ID NO:209, b 
is an integer of 1 5 to 734. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:209, and where b is greater than or 
equal to a + 14. 


H19497, H19579, H50n7, H50164, H52826, 
H52827, H6 1 1 84, H62087, H96290, H9629 1 , 
N20586, N21261, N28978, N30137, N30490, 
N35750, W31933, W37535, N90542, 
AA418545, AA418511 


799760 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 644 of SEQ ID NO:210, b 
is an integer of 15 to 658, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:210, and where b is greater than or 
equal to a + 14. 




799805 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 190 of SEQ IDN0;21Lb 
is an integer of 1 5 to 204, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
[^0:21 1, and where b is greater than or 
equal to a + 14. 




800296 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1257 of SEQ ID N0:2 12, 
D is an integer of 1 5 to 1 27 1 , where both 
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a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:212, and where b is greater than or 
equal to a + 14. 




800327 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 101 1 of SEQ IDNO:213, 
b is an integer of 15 to 1025, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;213, and where b is greater than or 
equal to a + 14. 




800816 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 337 of SEQ 1DN0:2I4, b 
is an integer of 15 to 351, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:214, and where b is greater than or 
equal to a + 14. 


• 


800835 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1073 of SEQ IDNO:215, 
b is an integer of 15 to 1087, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:215, and where b is greater than or 
equal to a+ 14. 




805429 


J'referably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1963 of SEQ ID NO:216, 
b is an integer of 1 5 to 1977, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:2 16, and where b is greater than or 
equal to a + 14. 




805458 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 


T82438, T82439, R19121, R20391, R28602, 
R36743, R43508, R46035, R43508, R46035, 
R79588, H24625, N28372, N28785, N29421, 
N35476, N57353, N72836, N79096, W03034, 
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formula of a-b, where a is any integer 
between 1 to 2801 ofSEQ IDNO:217, 
b is an integer of 1 5 to 28 1 5, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:2I7, and where b is greater than or 
equal to a + 14. 


AA016073. AA019733, AA021030, AA062895, 
AA08 1 968, AA 1 1 5692, AA 1 335 1 1 , AA 1 5 1 852, 
AA 149707, AA 194903, AA 194902 


805478 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1631 of SEQ IDNO:218, 
b is an integer of 15 to 1645, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:2I8, and where b is greater than or 
equal to a+ 14. 




805805 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 464 of SEQ ID NO:219, b 
is an integer of 1 5 to 478, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:219, and where b is greater than or 
equal to a + 14. 




806486 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 8 18 of SEQ ID NO; 220, b 
is an integer of 15 to 832, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 220, and where b is greater than or 
equal to a+ 14. 




806498 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1878 of SEQ ID NO:221, 
b is an integer of 1 5 to 1 892, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:221 , and where b is greater than or 
equal to a+ 14. 




806819 


Preferably excluded from the present 
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invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 854 of SEQ ID NO:222. b 
is an integer of 15 to 868. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:222, and where b is greater than or 
equal to a + 14. 




810870 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1502 of SEQ IDNO:223, 
b is an integer of 1 5 to 1516. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:223, and where b is greater than or 
equal to a + 14. 


R50267, R50730, H27672, H27673, H30138, 
H99256, N74342, N80868, W05054, W07601 


811730 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1292 of SEQ ID NO:224, 
b is an integer of 15 to 1306, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:224, and where b is greater than or 
equal to a + 14. 




813025 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 570 of SEQ ID NO:225, b 
is an integer of 1 5 to 584, where both a 
and b correspond to the positions of 
hucleotide residues shown in SEQ ID 
NO:225, and where b is greater than or 
equal to a + 14. 




813233 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 509 of SEQ ID NO:226, b 
is an integer of 1 5 to 523, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
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NO:226, and where b is greater than or 
equal to a + 14. 




813262 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2363 of SEQ ID NO:227, 
b is an integer of 15 to 2377, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:227, and where b is greater than or 
equal to a+ 14. 




815637 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 449 of SEQ ID NO:228, b 
is an integer of 15 to 463, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:228, and where b is greater than or 
equal to a+ 14. 




815853 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1218 of SEQ ID NO:229, 
b is an integer of 15 to 1 232, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;229, and where b is greater than or 
equal to a + 14. 


R53293, R59708, R59818, R88929, R89609, 
H78819, N52I82, AA125808, AA128281 


815999 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1049 of SEQ ID NO:230, 
b is an integer of 15 to 1063, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SfO:230, and where b is greater than or 
equal to a + 14. 




823427 


Preferably excluded from the present 
invention are one or more 
3olynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1049 of SEQ IDNO:23U 


T53986, T60846, T72425, R 18752, H22479, 
H502 1 1 , N408 1 7, N9343 1 , W2 1474, W2 1 308, 
W32281, W44860, W95821, N90881, 
A A 1 3 203 7, AA 1 3 1 965, A A 1 5 11 57, AA 1 5 5868, 
AA 1 56600, AA 1 56837, AA 1 5706 1 , AA 1 57045, 
AA 160623, AA 169460, AA 176447, AA 1 78894, 
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b is an integer of 15 to 1063, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:23 1, and where b is greater than or 
equal to a + 14. 


A A 1 79764, AA 1 80438, AA 1 8 1 145, AAl 8 1 144, 
AA 196382. AA196478 


823704 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between I to 1460 of SEQ IDNO:232, 
b is an integer of 15 to 1474, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:232, and where b is greater than or 
equal to a + 14. 




824798 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1768 of SEQ IDNO:233, 
b is an integer of 15 to 1782, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:233, and where b is greater than or 
equal to a + 14. 




825018 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
i^etween 1 to 2194 of SEQ ID NO:234, 
b is an integer of 1 5 to 2208, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 234, and where b is greater than or 
equal to a + 14. 




825076 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2566 of SEQ ID NO:235, 
3 is an integer of 1 5 to 2580, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SO:235, and where b is greater than or 
equal to a + 14. 




825787 


Preferably excluded from the present 
mvention are one or more 
polynucleotides comprising a nucleotide 
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sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2994 of SEQ ID NO:236, 
b is an integer of 15 to 3008, where both 
a and b con'espond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 23 6, and where b is greater than or 
equal to a + 14. 




826116 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 863 of SEQ ID NO:237, b 
is an integer of 1 5 to 877, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
KO:237, and where b is greater than or 
equal to a + 14. 




826147 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 3025 of SEQ ID NO:238, 
b is an integer of 15 to 3039, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:238, and where b is greater than or 
equal to a + 14. 




827020 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between i to 1978 of SEQ ID NO:239, 
b is an integer of 15 to 1992, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
t^O:239, and where b is greater than or 
equal to a + 14. 




827586 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 483 of SEQ ID NO:240, b 
IS an integer of 1 5 to 497, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO:240, and where b is greater than or 
squal to a + 14. 
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827732 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 302 of SEQ ID NO:24L b 
is an integer of 1 5 to 3 16, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:241 , and where b is greater than or 
equal to a + 14. 




827735 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 815 of SEQ ID NO:242, b 
is an integer of 15 to 829. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:242, and where b is greater than or 
equal to a + 14. 




827740 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 824 of SEQ ID NO:243, b 
is an integer of 15 to 838, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 243, and where b is greater than or 
equal to a + 14. 


R21513, R22316, R42033, R43706, R42033, 
R43706, R63113, R70954, R71006, N48618, 
N53377',AA9 12400 


827808 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2839 of SEQ IDNO:244, 
b is an integer of 15 to 2853, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:244, and where b is greater than or 
equal to a + 14. 




828251 


Preferably excluded from the present 
mvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 183 of SEQ ID NO:245, 
b is an integer of 15 to 1 197, where both 
a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID 
NO:245, and where b is greater than or 
equal to a + 14. 




828357 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 834 of SEQ ID NO:246, b 
is an integer of 15 to 848, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:246, and where b is greater than or 
equal to a + 14. 




828449 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1322 of SEQ ID NO:247, 
b is an integer of 1 5 to 1336, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:247, and where b is greater than or 
equal to a + 14. 




828612 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1062 of SEQ ID NO:248, 
b is an integer of 15 to 1076, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:248, and where b is greater than or 
equal to a+ 14. 


R285I3, R28661, R3 1336, R41867, R41867, 
R60004 HI 9945 HI 9946 H2206I H46271 
H46342, H826I9, H82618, N20678, W96169, 
AA010842, AA278855. AA582295, AA583721, 
AA639735, AA579409, AA568321, AA833752, 
AA907437, AI054389, W22584 


828647 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 241 1 of SEQ ID NO:249, 
b is an integer of 15 to 2425, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
W:249, and where b is greater than or 
squal to a + 14. 




828698 


Preferably excluded from the present 
nvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer | 
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between 1 to 1394 of SEQ ID NO:250, 
b is an integer of 1 5 to 1408, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:250, and where b is greater than or 
equal to a + 14. 




828962 


Preferably excluded from the present 

invf^ntinn art* rtnp or morp 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 480 of SEQ ID NO:25 1 , b 
is an integer of 1 5 to 494, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:251, and where b is greater than or 
equal to a + 14. 




828982 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2477 of SEQ IDNO:252, 
b is an integer of 1 5 to 249 1 , where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:252, and where b is greater than or 
equal to a + 14. 


T64550, T65973, T94849, T94894, R07359, 
R07409, R34782. R35670. R35781, R56137, 
R56532, R64039, R66397, R67131, H01215, 
H02256, H02354, H03227, H04019, R94572, 
R94573, H51242, H60286, H65939, H72416, 
H72857, N22537,N24628, N24936, N33813, 
N35712, N35830, N35916. N43982, N51363, 
N64462, N70838, N75470, N75760, W01444, 
W05279, W57605, W58752, W72612, W72970, 
W73260, W73535, W76678, W76207, W94918, 
W9197I, W92319, W92355, AA024690, 
AA024643, AA028083, AA028084, AA028169, 
AA035743, AA045830, AA045917, AA081723, 
AA086310, AA085740, AA102651, AA101305, 
AA126788, AA126837, AA126865, AA127295, 
AA129688, AA129664, AA133503, AA133504, 
AA132801, AA134537, AA134547, AA186712, 
AA188264 AA215597 AA46'^077 17 
AA417286, AA417312, AA259228, AA279952, 
AA287814, AA468227, AA468302, AA526480, 
AA553703, AA587072, AA635683, AA639361, 
AA573471, AA579754, AA579812, AA580600, 
AA730425, AA741436, AA804629, AA829189, 
AA830255, AA865594, AA885821, AA918979, 
AA962033, AA985542, AA985571, AA987607, 
AA995783, A1075334, D79160, N84712, 
N88655, C03235, AA094028 


829282 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 1 1 1 of SEQ ID NO:253, 
0 is an integer of 1 5 to 1 125, where both 
a and b correspond to the positions of 
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nucleotide residues shown in SEQ ID 
NO:253, and where b is greater than or 
equal to a + 14. 




829368 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1395 of SEQ IDNO:254, 
b is an integer of 15 to 1409, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:254, and where b is greater than or 
equal to a + 14. 


R61547, R76124. H01565, H02950, H04248, 
H29996, H99672, W 19970 


829751 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 476 of SEQ ID NO:255, b 
is an integer of 15 to 490, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:255, and where b is greater than or 
equal to a + 14. 




829773 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1219 of SEQ ID NO:256, 
b is an integer of 15 to 1233, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:256, and where b is greater than or 
equal to a + 14. 


T96982, T97094, H53488, H53861, H64894, 
H65486, N62304, N67480, N78709, W03409, 
W07598, W73770, AA025496, AA025812, 
AA 1 33948 


829934 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2390 of SEQ ID NO:257, 
b is an integer of 1 5 to 2404, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SJO:257, and where b is greater than or 
equal to a + 14. 




829942 


■Preferably excluded from the present 
nvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
■ormula of a-b, where a is any integer 


T64541, T65964, R01423, R01424, R05277, 
R19450, R44699, R51779, R51780, R44699, 
H 1 1 322, H 1 1 349, H 1 3859, H 1 39 1 1 , H2 1 393, 
H21437, H21890, H221 17, H45982, H46047, 
H47137, R98886, H54491, H54854, H98744, 
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between I to 2078 of SEQ ID NO:258, 
b is an integer of 1 5 to 2092, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:258, and where b is greater than or 
equal to a + 14. 


N23465, N37080, N46155, N46396, N58995. 
N62715, N93640, W60228, W60227, W74349, 
W76544, VV87768, W87883, W90517, W90518, 
AA010775, AAOl 1055, AA029083, AA029084, 
AA036822, AA057660, AA075916, AA082814, 
AA101057, AA130702, AA132788, AA133063, 
AA147813, AA148063, AA15I487, AA151511, 
AA173298, AA173348, AA181036, AA187993, 
AA187994, AA192370, AA192357, AA243010, 
AA243264, AA250948 


829951 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 373 of SEQ ID NO:259, b 
is an integer of 15 to 387, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;259, and where b is greater than or 
equal to a + 14. 




830173 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 3698 of SEQ IDNO:260, 
b is an integer of 15 to 3712, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 260, and where b is greater than or 
equal to a+ 14. 


T52493, T52572, T56913, T61268, T61320, 
T70063, T70130, T72005, T87844, T94182, 
T70248, R24534, R24639, R31200, R64161, 
R64274, R70751, R70750, H 16 1 89, H89274, 
H99749, N25430, N25537, N32578, N32816, 
N34 1 20, N34 1 34, N3449 1 , N3508 1 , N42260, 
N4382 1 , N62 1 52, N62798, N64065. N64 1 69, 
N67362, N69808, N74678, N93912. N49165, 
W04704, W05040, W 1 6565, W 19920, W3 1*806, 
W31907, W37354, W37355, W40493, W45266, 
W45455, W52925, W58628, W92222, W92345, 
N91265, AA027083, AA027124, AA028969, 
AA029137, AA029257, AA083657, AA084297, 
AA121 151, AA121 131, AA126957, AA127166, 
AA128353, AA128495, AA128834, AA132690, 
AA132783, AA136553, AA152414, AA150706, 
AA 150808, AA 156272, AA 1 64766, AA164767, 
AA171427, AA171794, AA173592, AA173949, 
AA 1 9042 1 , AA 1 90580, AA 1 9 1 3 83 , AA2244 1 5, 
AA232135 


830200 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
*ormula of a-b, where a is any integer 
between 1 to 883 of SEQ ID NO:261, b 
is an integer of 1 5 to 897, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:261, and where b is greater than or 
squal to a + 14. 


AA524284, AA662477, AA887924 
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830365 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1891 of SEQ ID NO:262, 
b is an integer of 15 to 1905, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:262, and where b is greater than or 
equal to a -h 14. 


a42905, R59718, R624I9, R72182, R72228, 
H22520, H22519, H25889, H45643, H46451, 
H46992, H84483, N50834, N92573, AA022699, 
AA022791, AA037734, AA037735, AA040585, 
AA040557, AA0478I6, AA159187, AA159282, 
AA223337, AA505391, AA51559K AA524466, 
AA613383, AA627298, AA578816, AA769153, 
AA826456, AA830896, AA83I083, AA837917, 
AA977053, AI083822. AI090301, A1084104 


830456 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
fonmula of a-b, where a is any integer 
between 1 to 1410 of SEQ ID NO;263, 
b is an integer of 15 to 1424, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:263, and where b is greater than or 
equal to a + 14. 


T39800, T39875, T4033L T80148, ROl 135, 
R05754, R12866, R15287, R21703, R39361, 
H00652, H00741, H05366, H17706, H23423, 
R97800, R97849, N25478, N4 1 797, N485 1 1 , 
N98906, W 19893, W23945, W35I74, W60540, 
W78229, W79282, W84685, AA022952, 
AA026821, AA026953, AA074956, AA0751 1 1, 
AAl 14974, AAl 14988, AA 192860, AA 193064 


830549 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described- by the general 
formula of a-b, where a is any integer 
between I to 1273 of SEQ 1DN0:264, 
b is an integer of 15 to 1287, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:264, and where b is greater than or 
equal to a + 14, 


R60171, H26796, H96303, N91699, W25137, 
AA069218, AA088565. AA161 178 


830602 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 977 of SEQ ID NO:265, b 
is an integer of 1 5 to 99 1 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:265, and where b is greater than or 
equal to a + 14. 




830610 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
"ormula of a-b, where a is any integer 
between 1 to 2306 of SEQ ID NO:266, 
) is an integer of 15 to 2320, where both 
1 and b correspond to the positions of 
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nucleotide residues shown in SEQ ID 
NO;266, and where b is greater than or 
equal to a + 14. 




830644 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 409 of SEQ ID NO:267, b 
is an integer of 1 5 to 423, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:267, and where b is greater than or 
equal to a + 14. 




830707 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1832 of SEQ ID NO:268, 
b is an integer of 1 5 to 1 846, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;268, and where b is greater than or 
equal to a+ 14, 




830709 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 587 of SEQ ID NO:269, b 
is an integer of 1 5 to 60 ! , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 269, and where b is greater than or 
equal to a + 14. 




830733 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 866 of SEQ ID NO:270, b 
is an integer of 15 to 880, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:270, and where b is greater than or 
equal to a+ 14. 


T26638, R49962, H96664, N71762, N90691, 
AA040156 AA128271 AA4I8045 AA418216 
AA535799. AA583405, AA76881 1 


830768 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 
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between 1 to 2470 of SEQ ID NO:27l , 
b is an integer of 1 5 to 2484, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:271, and where b is greater than or 
equal to a + 14. 




830855 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 737 of SEQ ID NO:272, b 
is an integer of 1 5 to 75 1 , where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:272, and where b is greater than or 
equal to a + 14. 


H 1 7 1 27, AA 1 003 1 1 , AA 1 1 29 1 0, AA282249, 
AA578649 AA748590 


830949 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 3295 of SEQ ID NO:273, 
b is an integer of 15 to 3309, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO: 273, and where b is greater than or 
equal to a + 14. 




830965 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 829 of SEQ ID NO:274, b 
is an integer of 15 to 843. where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:274, and where b is greater than or 
equal to a + 14. 




830973 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2014 of SEQ ID NO:275, 
3 is an integer of 15 to 2028, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO: 275, and where b is greater than or 
equal to a+ 14. 




830979 


r*referably excluded from the present 
nvention are one or more 
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polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1441 ofSEQ IDNO:276, 
b is an integer of 15 to 1455, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:276, and where b is greater than or 
equal to a + 14. 




830989 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1909 of SEQ ID NO:277, 
b is an integer of 1 5 to 1923, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;277, and where b is greater than or 
equal to a + 14. 




831134 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1366 of SEQ ID NO:278, 
b is an integer of 1 5 to 1380, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
lv[0:278, and where b is greater than or 
equal to a + 14. 




831200 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1004 of SEQ ID NO: 279, 
b is an integer of 15 to 1018, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SfO:279, and where b is greater than or 
equal to a + 14. 




831260 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 178 of SEQ ID NO:280, 
> is an integer of 15 to 1 192, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO:280, and where b is greater than or 


R15008, R28066, R68324, H20638, N25438, 
N67982, N67983, N67999, N68004, N68005, 
N80403, N80423, N80429, N80430, AA024581, 
AA024582, AA024637, AA862760, AA091 142 
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0-> 1 -) J I 


r ICLCiuUlY CALlUUCLi IIUIII IMC ^ItotUl 

invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1741 ofSEQ IDNO:28L 
b is an integer of 1 5 to 1755, where both 
a and b con*espond to the positions of 
nucleotide residues shown in SEQ ID 
NO:28l, and where b is greater than or 
equal to a + 14. 


T66624. R16038, R26139, R26353, H 15795, 
H16285, H2I749, H21945, H22698, H23978, 
H52286, H52523, H60184, H60227, H68044, 
H8 1 748, H8 i 749. N46859, N47 1 79, N5 1 722, 
N5 1 808, AA03 1 70 1 , AA03 1 866, AA043760, 
AA04376K AA08I005, AA081 148, AA195519, 
AA470636, AA534463, AA555I98, AA631348, 
AA72I036, AA737025, AA761301, AA764993, 
AA765314, AA765749, AA878422, U47720, 
C2I223 


oj I DO J 


rreieraDiy exciuuea irom me pieseni 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1079 of SEQ ID NO:282, 
b is an integer of 1 5 to 1093. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:282, and where b is greater than or 
equal to a + 14. 




07 1 7T/1 
OJ 1 /Z4 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1542 of SEQ IDNO:283, 
b is an integer of 1 5 to 1 556, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:283, and where b is greater than or 
equal to a + 14. 


R52 1 6 1 , N45 1 79, N68350, N9402 1 , W02782, 
W24840 W61323 AA907441 


OJ loo** 


rreieraDiy exciuuea rrom inc present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1015 of SEQ ID NO:284, 
? is an integer of 15 to 1029, where both 

a find h pniTf^^innnfl to the nn<*itinn^ of 
nucleotide residues shown in SEQ ID 
MO:284, and where b is greater than or 
equal to a + 14. 




831897 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1569 of SEQ IDNO:285, 
0 is an integer of 15 to 1583, where both 


AA056348, AA127534 
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a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:285, and where b is greater than or 
equal to a + 14. 




831922 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1 163 of SEQ ID NO:286. 
b is an integer of 15 to 1 177, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:286, and where b is greater than or 
equal to a + 14. 




831963 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 492 of SEQ ID NO:287, b 
is an integer of 15 to 506, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:287, and where b is greater than or 
equal to a + 14. 




832074 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 934 of SEQ ID NO:288, b 
is an integer of 15 to 948, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:288, and where b is greater than or 
equal to a + 14, 




832266 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
brmula of a-b, where a is any integer 
between 1 to 1020 of SEQ ID NO:289, 
b is an integer of 15 to 1034, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
>JO:289, and where b is greater than or 
equal to a + 14. 


T70612, T70879, Hi 3555, H23264, R97792, 
R97842, N75850, W07434, W 19866, N90056, 
AA043395, AA463232, AA463231 


832309 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
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formula of a-b, where a is any integer 
between 1 to 3077 of SEQ ID NO:290, 
b is an integer of 1 5 to 3091 . where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:290, and where b is greater than or 
equal to a + 14. 




832342 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 504 of SEQ ID NO:29 1 . b 
is an integer of 1 5 to 5 1 8, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:29I, and where b is greater than or 
equal to a + 14. 




832351 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 484 of SEQ ID NO:292, b 
is an integer of 15 to 498, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:292, and where b is greater than or 
equal to a + 14. 




832352 


t*referably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 455 of SEQ ID NO:293, b 
is an integer of 1 5 to 469, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:293, and where b is greater than or 
equaltoa-f 14. 




832434 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 654 of SEQ ID NO:294, b 
rs an integer of 15 to 668, where both a 
dnd b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:294, and where b is greater than or 
equal to a + 14. 




832490 


Preferably excluded from the present 


T86496, H24346, R84505, N26874, N9862I, 
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invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 13S6 of SEQ ID NO:295. 
b is an integer of 15 to 1400, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:295, and where b is greater than or 
equal to a + 14. 


W04678, W04692, W24267, W93387, W94971, 
AA036953, AA 1 36869, AM 36799, AA 1 472 14, 
AA 1604 13, AA535592, AA93 1261 , AA93 1403, 
AA962726. AA992456 


832573 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 946 of SEQ ID NO:296. b 
is an integer of 15 to 960, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:296, and where b is greater than or 
equal to a + 14, 




832580 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 643 of SEQ ID NO:297, b 
is an integer of 15 to 657, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:297, and where b is greater than or 
equal to a + 14. 




833394 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 878 of SEQ ID NO:298, b 
is an integer of 15 to 892, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
hJO:298, and where b is greater than or 
equal to a + 14. 




835355 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b. where a is any integer 
between 1 to 1610 of SEQ ID NO:299, 
3 is an integer of 15 to 1624, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 


AA076638, AA9i6592, AI088936, AI089690 
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NO:299, and where b is greater than or 
equal to a + 14. 




835497 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1955 of SEQ IDNO:300, 
b is an integer of 15 to 1969. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:300, and where b is greater than or 
equal to a + 14. 




835728 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1868 of SEQ IDNO:30K 
b is an integer of 1 5 to 1882, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:301, and where b is greater than or 
equal to a + 14. 




835978 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2790 of SEQ ID NO:302, 
b is an integer of 15 to 2804, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:302, and where b is greater than or 
equal to a + 14. 




836091 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
'ormula of a-b, where a is any integer 
between I to 3845 of SEQ IDNO:303, 
0 is an integer of 15 to 3859, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO:303, and where b is greater than or 
equal to a + 14. 


^02093, R02205, R02336, R02439, R19436, 
R44685, R44685, R72354, HI0i6O, H49884, 
H49885, N23208, N28789, N2990 1 , N42953, 
N55093, N77305, N99373, W46396, W46504, 
AA08231 1, AA 176281, AA 176282, AA227971, 
AA228079, AA234964, AA234145, AA281787, 
AA28I656, AA524468, AA551888, AA631 173, 
AA639499, AA81 1344, AA830439, AA831974, 
AA923665, C03439, AA641655, AA091346, 
AA400968, AA400884 


836274 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
'ormula of a-b, where a is any integer 
between I to 3364 of SEQ ID NO:304, 


T75442, R20393, R435 1 1 . R435 1 1 , R73650, 
R7373K R80152, R80886, H97932, H98616, 
N330I8, N71679, N99650, AA001053, 
AA001089, AA044947, A A044943, AA 149057, 
AA464856, AA427892, AA228265, AA230021, 
AA482694, AA483691, AA484850, AA5 13037, 
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b is an integer of 15 to 3378, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:304, and where b is greater than or 
equal to a + 14. 


AA5 16076. AA532381, AA583355, AA6I8566, 
AA577028. AA730651, AA730790, AA745667, 
AA829807, AA923038, AA931937, AA932867, 
AA934400, AA9344 1 3, AA97 1551, AA97 1 743, 
AA972772. AA977253, AA992454, AA994794, 
AI089906, AI09492I, D7928I, C06099, 
D44840, C20741, AA283I86, AA292346, 
AA394I64 


836731 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1000 of SEQ ID NO:305, 
b is an integer of 15 to 1014, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:305, and where b is greater than or 
equal to a + 14. 




838014 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2 1 1 3 of SEQ ID NO:306, 
b is an integer of 1 5 to 2 127, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:306, and where b is greater than or 
equal to a + 14. 




838874 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 652 of SEQ ID NO:307, b 
is an integer of 15 to 666, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO;307, and where b is greater than or 
equal to a + 14. 


R61165, N44200 


839120 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
'ormula of a-b, where a is any integer 
between 1 to 2 1 57 of SEQ ID NO:308, 
b is an integer of 15 to 2171, where both 
1 and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
SIO:308, and where b is greater than or 
jqual to a + 14. 


T74462, R18264, H23432, AA279685, 
AA847441, AA904076, AA393782 



wo 00/55173 



!44 



PCT/USOO/05881 



839611. 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 6149 of SEQ ID NO:309, 
b is an integer of 15 to 6163, where both 
a and b coirespond to the positions of 
nucleotide residues shown in SEQ ID 
NO:309, and where b is greater than or 
equal to a + 14. 


T93695, T93696, T96l6i, R32227, R32254, 
R32304, R33503, R34044. R71 178, H93366, 
N50709, N55039, AA165143, AA199856, 
AA 1 99927, AA23433 1 . A A262892, AA423987, 
AA423986. AA525886. AA661602, AA73I504! 
AA741228, AA814795, AA828858! AA829196, 
AA831 198, AA834822, AA865590, AA886436, 
AA903649, D82270, D82453, D82464, 
AA642466, AA2 19620, AA2 19628, AA400707, 
AA400674, AA42 1941 , AA633988, AA663219, 
AA663250, AA665538, AA724260, AI0747I4, 
T26891,T26926 


840138 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 2072 of SEQ ID NO:310, 
b is an integer of 1 5 to 2086. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO;3I0, and where b is greater than or 
equal to a + 14. 




840616 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 2 1 49 of SEQ ID N0:3 1 1 , 
b is an integer of 1 5 to 2 1 63, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:311, and where b is greater than or 
equal to a + 14. 




840780 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 1383 of SEQ IDNO:312, 
b is an integer of 15 to 1397, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
M0:3 12, and where b is greater than or 
equal to aH- |4. 




840857 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
'ormula of a-b, where a is any integer 
between I to 4092 of SEQ ID NO:313, 
b is an integer of 15 to 4106, where both 


T50389, T50520, T554I9, T55495, T55974, 
T57220, R3459I, R34592, R69726, H21 148, 
R85777, R99233, H6131 1. H62351, H85185, 
H88299, N23288, N32662, N58504, N78093, 
N92665, N9961 1, AA005068, AA007333, 
AA007334, AA036884, AA044715, AA045458, 
AA046500, AA045654. AA 1 15936, AA 12 1004, 
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a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:3 13, and where b is greater than or 
equal to a + 14. 


AA126775, AAI33605, AA133606, AAI33980, 
AA181633. AA18261 L AA232979. AA233365, 
AA459953, AA460042, AA282826, AA285050, 
AA506082, AA558006, AA601060, AA767799, 
AA804323, AA807029, AA807087, AA825536, 
AA8338ia AA922732, AA928638, AA960990, 
N56482, N62047, W27456, W26569, 
AA092778. AA652535, AA065256, AA065257, 
AA450197, AA452846, AA452986, AA705224, 
Z 19460, AA884767. AA969488, AA977494, 
A1002996, AI032008, Z28526, D201 12, T19336 


840862 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 518 of SEQ IDN0;3H,b 
is an integer of 15 to 532, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:3 14, and where b is greater than or 
equal to a + 14. 


194528. N40545, N46592, N92934, AA570273, 
AA873604, AA9 10827, AA932397, AA971868, 
A10952 1 0, N56229, AA648290, F20835, 
AA629912 


840864 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1924 of SEQ IDNO:315, 
b is an integer of 15 to 1938, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:3 15, and where b is greater than or 
equal to a+ 14. 


R40870, R44820, H26640, W78814, W807I3, 
AA 195492, AA937549, A1085492, AI094865, 
AA4493I7, AA8846G0, AA909529, AA923452, 
AA971781, AI084795. AI089007, AA702758, 
AA702769 


840936 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 804 of SEQ ID NO:316, b 
is an integer of 1 5 to 8 1 8, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:316, and where b is greater than or 
equal to a + 14. 




840938 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between i to 823 of SEQ IDNO:317, b 
s an integer of 1 5 to 837, where both a 
ind b correspond to the positions of 
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nucleotide residues shown in SEQ ID 
N0:3 1 7, and where b is greater than or 
equal to a + 14. 




841884 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 1434 of SEQ IDNO:318. 
b is an integer of 15 to 1448. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:3 1 8, and where b is greater than or 
equal to a + 14. 




84224 i 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b., where a is any integer 
between 1 to 1479 of SEQ ID NO:319, 
b is an integer of 1 5 to 1493, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
N0:3 19, and where b is greater than or 
equal to a + 14. 




843712 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 595 of SEQ ID NO:320, b 
is an integer of 1 5 to 609, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:320, and where b is greater than or 
equal to a + 14. 


R0229I, N94598, W85882, AA255975 


844040 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 488 of SEQ IDNO:32i,b 
is an integer of 15 to 502, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:321, and where b is greater than or 
equal to a + 14, 


W24428, AA143434, AA459809 


844336 


Preferably excluded from the present 
mvention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
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between 1 to 2616 of SEQ ID NO:322, 
b is an integer of 1 5 to 2630. where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:322, and where b is greater than or 
equal to a + [4. 




844612 


Preferably excluded from the present 

lllVdlllL'll die UllC Ul IIIV/IC 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to I860 of SEQ IDNO:323, 
b is an integer of 1 5 to 1 874, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:323, and where b is greater than or 
equal to a + 14. 




844617 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 23 1 1 of SEQ ID NO:324, 
b is an integer of 1 5 to 2325, where both 
a and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:324, and where b is greater than or 
equal to a + 14. 




845251 


Preferably excluded from the present 

invention are nnp nr mnrp 

polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between 1 to 771 of SEQ ID NO:325, b 
is an integer of 15 to 785, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
NO:325, and where b is greater than or 
equal to a+ 14. 


168474, AA 159 183, AA464447, AA424290, 
AA4'?44S7 AA/^'^17Q'^ AA09R'^0n AAQdMl] 

AA975194, AA977141 , AA430527. AA4306I2, 
AA477798 


845764 

* 


Preferably excluded from the present 
invention are one or more 
polynucleotides comprising a nucleotide 
sequence described by the general 
formula of a-b, where a is any integer 
between I to 230 of SEQ ID NO:326, b 
s an integer of 1 5 to 244, where both a 
and b correspond to the positions of 
nucleotide residues shown in SEQ ID 
MO:326, and where b is greater than or 
jqual to a + 14. 




846187 

1 


Preferably excluded from the present 
nvention are one or more 
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polynucleotides comprising a nucleoiide 

sequence described by the general 

formula of a-b, where a is any integer 

between I to 2440 of SEQ ID NO:327, 

b is an integer of 1 5 to 2454. where both 

a and b con*espond to the positions of 

nucleotide residues shown in SEQ ID 

NO:327, and where b is greater than or 

|equai to a 4- 14. 

Polynucleotide and Polypeptide Variants 

The present invention is directed to variants of the polynucleotide sequence disclosed 

in SEQ ID NO:X or the complementary strand thereto, and/or the cDNA sequence contained 

in a cDNA clone contained in the deposit. 

5 The present invention also encompasses variants of the breast, ovarian, breast cancer 

and/or ovarian cancer polypeptide sequence disclosed in SEQ ID NO:Y, a polypeptide 

sequence encoded by the polynucleotide sequence in SEQ ID NO:X, and/or a polypeptide 

sequence encoded by the cDNA in the related cDNA clone contained in the deposit. 

"Variant" refers to a polynucleotide or polypeptide differing from the polynucleotide 

10 or polypeptide of the present invention, but retaining essential properties thereof Generally, 

variants are overall closely similar, and, in many regions, identical to the polynucleotide or 

polypeptide of the present invention. 

The present invention is also directed to nucleic acid molecules which comprise, or 

altematively consist of, a nucleotide sequence which is at least 80%, 85%, 90%, 95%, 96%, 

15 97%, 98%, 99% or 100%, identical to, for example, the nucleotide coding sequence in SEQ 
ID NO:X or the complementary strand thereto, the nucleotide coding sequence of the related 
cDNA contained in a deposited library or the complementary strand thereto, a nucleotide 
sequence encoding the polypeptide of SEQ ID NO:Y, a nucleotide sequence encoding a 
polypeptide sequence encoded by the nucleotide sequence in SEQ ID NO:X, a nucleotide 

20 sequence encoding the polypeptide encoded by the cDNA in the related cDNA contained in a 
deposited library, and/or polynucleotide fragments of any of these nucleic acid molecules 
(e.g., those fragments described herein). Polypeptides encoded by these nucleic acid 
molecules are also encompassed by the invention. In another embodiment, the invention 
encompasses nucleic acid molecules which comprise or alternatively consist of, a 

25 polynucleotide which hybridizes under stringent hybridization conditions, or alternatively, 
under low stringency conditions, to the nucleotide coding sequence in SEQ ID NO:X, the 
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nucleotide coding sequence of the related cDNA clone contained in a deposited library, a 
nucleotide sequence encoding the polypeptide of SEQ ID NO:Y, a nucleotide sequence 
encoding a polypeptide sequence encoded by the nucleotide sequence in SEQ ID NO:X, a 
nucleotide sequence encoding the polypeptide encoded by the cDNA in the related cDNA 
5 clone contained in a deposited library, and/or polynucleotide fragments of any of these 
nucleic acid molecules (e.g., those fragments described herein). Polynucleotides which 
hybridize to the complement of these nucleic acid molecules under stringent hybridization 
conditions or alternatively, under lower stringency conditions, are also encompassed by the 
invention, as are polypeptides encoded by these polynucleotides. 

10 The present invention is also directed to polypeptides which comprise, or alternatively 

consist of, an amino acid sequence which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
99% or 100% identical to, for example, the polypeptide sequence shown in SEQ ID NO:Y, a 
polypeptide sequence encoded by the nucleotide sequence in SEQ ID NO:X, a polypeptide 
sequence encoded by the cDNA in the related cDNA clone contained in a deposited library, 

15 and/or polypeptide fragments of any of these polypeptides (e.g., those fragments described 
herein). Polynucleotides which hybridize to the complement of the nucleic acid molecules 
encoding these polypeptides under stringent hybridization conditions, or alternatively, under 
lower stringency conditions, are also encompassed by the invention, as are polypeptides 
encoded by these polynucleotides. 

20 By a nucleic acid having a nucleotide sequence at least, for example, 95% "identical" 

to a reference nucleotide sequence of the present invention, it is intended that the nucleotide 
sequence of the nucleic acid is identical to the reference sequence except that the nucleotide 
sequence may include up to five point mutations per each 100 nucleotides of the reference 
nucleotide sequence encoding the polypeptide. In other words, to obtain a nucleic acid 

25 having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 
5% of the nucleotides in the reference sequence may be deleted or substituted with another 
nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. The query sequence may be, for 
example, an entire sequence referred to in Table 1 , an ORF (open reading frame), or any 

30 fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is 
at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of 
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the present invention can be determined conventionally using known computer programs. A 
preferred method for determining the best overall match between a query sequence (a 
sequence of the present invention) and a subject sequence, also referred to as a global 
sequence alignment, can be determined using the FASTDB computer program based on the 
5 algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245 (1990)). In a sequence alignment 
the query and subject sequences are both DNA sequences. An RNA sequence can be 
compared by converting U's to T's. The result of said global sequence alignment is in 
percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identiy are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining 
10 Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size 
Penalty 0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever 
is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the results. 

15 This is because the FASTDB program does not account for 5' and 3' truncations of the 
subject sequence when calculating percent identit>'. For subject sequences truncated at the 5' 
or 3' ends, relative to the query sequence, the percent identity is corrected by calculating the 
number of bases of the query sequence that are 5' and 3' of the subject sequence, which are 
not matched/aligned, as a percent of the total bases of the query sequence. Whether a 

20 nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment. 
This percentage is then subtracted from the percent identity, calculated by the above 
FASTDB program using the specified parameters, to arrive at a final percent identity score. 
This corrected score is what is used for the purposes of the present invention. Only bases 
outside the 5' and 3' bases of the subject sequence, as displayed by the FASTDB alignment, 

25 which are not matched/aligned with the query sequence, are calculated for the purposes of 
manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query sequence to 
determine percent identity. The deletions occur at the 5' end of the subject sequence and 
therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases at 

30 5' end. The 10 unpaired bases represent 10% of the sequence (number of bases at the 5' and 
3' ends not matched/total number of bases in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 bases 
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were perfectly matched the final percent identity would be 90%. In another example, a 90 
base subject sequence is compared with a 100 base query sequence. This time the deletions 
are internal deletions so that there are no bases on the 5' or 3' of the subject sequence which 
are not matched/aligned with the query. In this case the percent identity calculated by 
5 FASTDB is not manually corrected. Once again, only bases 5' and 3' of the subject sequence 
which are not matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to made for the purposes of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that the 

10 amino acid sequence of the subject polypeptide is identical to the query sequence except that 
the subject polypeptide sequence may include up to five amino acid alterations per each 100 
amino acids of the query amino acid sequence. In other words, to obtain a polypeptide 
having an amino acid sequence at least 95% identical to a query amino acid sequence, up to 
5% of the amino acid residues in the subject sequence may be inserted, deleted, (indels) or 

15 substituted with another amino acid. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere 
between those terminal positions, interspersed either individually among residues in the 
reference, sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 80%, 85%, 90%, 

20 95%, 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence in SEQ ID 
NO:Y or a fragment thereof, the amino acid sequence encoded by the nucleotide sequence in 
SEQ ID NO:X or a fragment thereof, or the amino acid sequence encoded by the cDNA in 
the related cDNA clone contained in a deposited library, or a fragment thereof, can be 
determined conventionally using known computer programs. A preferred method for 

25 determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, can be 
determined using the FASTDB computer program based on the algorithm of Brut lag et al. 
(Comp. App. Biosci. 6:237- 245(1990)). In a sequence alignment the query and subject 
sequences are either both nucleotide sequences or both amino acid sequences. The result of 

30 said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2. Mismatch Penalty=l, 
Joining Penalty=20, Randomization Group Length=0, Cutoff Score=I, Window 
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Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the 
length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-terminal 
deletions, not because of internal deletions, a manual correction must be made to the results. 
5 This is because the FASTDB program does not account for N- and C-terminal tiamcations of 
the subject sequence when calculating global percent identity. For subject sequences 
truncated at the N- and C-termini. relative to the query sequence, the percent identity is 
corrected by calculating the number of residues of the query sequence that are N- and C- 
terminal of the subject sequence, which are not matched/aligned with a corresponding subject 
10 residue, as a percent of the total bases of the query sequencie. Whether a residue is 
matched/aligned is determined by results of the FASTDB sequence alignment. This 
percentage is then subtracted from the percent identity, calculated by the above FASTDB 
program using the specified parameters, to arrive at a final percent identity score. This final 
percent identity score is what is used for the purposes of the present invention. Only residues 

15 to the N- and C-termini of the subject sequence, which are not matched/aligned with the 
query sequence, are considered for the purposes of manually adjusting the percent identity 
score. That is, only query residue positions outside the farthest N- and C- terminal residues 
of the subject sequence. 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue 

20 query sequence to determine percent identity. The deletion occurs at the N-terminus of the 
subject sequence and therefore, the FASTDB alignment does not show a matching/alignment 
of the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the 
sequence (number of residues at the N- and C- termini not matched/total number of residues 
in the query sequence) so 10% is subtracted from the percent identity score calculated by the 

25 FASTDB program. If the remaining 90 residues were perfectly matched the final percent 
identity would be 90%, In another example, a 90 residue subject sequence is compared with 
a 100 residue query sequence. This time the deletions are internal deletions so there are no 
residues at the N- or C-termini of the subject sequence which are not matched/aligned with 
the query. In this case the percent identity calculated by FASTDB is not manually corrected. 

30 Once again, only residue positions outside the N- and C-terminal ends of the subject 
sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the 
query sequence are manually corrected for. No other manual corrections are to made for the 



wo 00/55173 



PCT/USOO/05881 



153 

purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding regions, or 
both. Especially preferred are polynucleotide variants containing alterations which produce 
silent substitutions, additions, or deletions, but do not alter the properties or activities of the 
5 encoded polypeptide. Nucleotide variants produced by silent substitutions due to the 
degeneracy of the genetic code are preferred. Moreover, variants in which less than 50, less 
than 40, less than 30, less than 20, less than 10, or 5-50, 5-25, 5-10, 1-5, or 1-2 amino acids 
are substituted, deleted, or added in any combination are also preferred. Polynucleotide 
variants can be produced for a variety of reasons, e.g., to optimize codon expression for a 
10 particular host (change codons in the human mRNA to those preferred by a bacterial host 
such as E. coli). 

NaUirally occurring variants are called "allelic variants," and refer to one of several 
alternate forms of a gene occupying a given locus on a chromosome of an organism. (Genes 
II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These allelic variants can vary at 

15 either the polynucleotide and/or polypeptide level and are included in the present invention. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis techniques 
or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA technology, 
variants may be generated to improve or alter the characteristics of the polypeptides of the 

20 present invention. For instance, as discussed herein, one or more amino acids can be deleted 
from the N-terminus or C-terminus of the polypeptide of the present invention without 
substantial loss of biological function. The authors of Ron et al., J. Biol. Chem. 268: 2984- 
2988 (1993), reported variant KGF proteins having heparin binding activity even after 
deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 

25 exhibited up to ten times higher activity after deleting 8-10 amino acid residues from the 
carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7: 199-216 (1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological activity 
similar to that of the naturally occurring protein. For example, Gayle and coworkers (J. Biol. 
Chem 268:22105-221 1 1 (1993)) conducted extensive mutational analysis of human cytokine 

30 IL-la. They used random mutagenesis to generate over 3,500 individual IL-la mutants that 
averaged 2.5 amino acid changes per variant over the entire length of the molecule. Multiple 
mutations were examined at every possible amino acid position. The investigators found that 
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"[m]osl of the molecule could be altered with little effect on either [binding or biological 
activity]." (See, Abstract.) In fact, only 23 unique amino acid sequences, out of more than 
3,500 nucleotide sequences examined, produced a protein that significantly differed in 
activity from wild-type. 

5 Furthermore, as discussed herein, even if deleting one or more amino acids from the 

N-terminus or C-terminus of a polypeptide results in modification or loss of one or more 
biological functions, other biological activities may still be retained. For example, the ability 
of a deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form are 

10 removed from the N-terminus or C-terminus. Whether a particular polypeptide lacking N- or 
C-termjnal residues of a protein retains such immunogenic activities can readily be 
determined by routine methods described herein and otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show a functional 
activity (e.g., biological activity) of the polypeptide of the invention of which they are a 

15 variant. Such variants include deletions, insertions, inversions, repeats, and substitutions 
selected according to general rules known in the art so as have little effect on activity. 

The present application is directed to nucleic acid molecules at least 80%, 85%, 90%, 
95%, 96%, 97%, 98%, 99% or 100% identical to the nucleic acid sequences disclosed herein 
or fragments thereof, (e.g., including but not limited to fragments encoding a polypeptide 

20 having the amino acid sequence of an N and/or C terminal deletion), irrespective of whether 
they encode a polypeptide having functional activity. This is because even where a particular 
nucleic acid molecule does not encode a polypeptide having functional activity, one of skill 
in the art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe or a polymerase chain reaction (PGR) primer. Uses of the nucleic acid 

25 molecules of the present invention that do not encode a polypeptide having functional activity 
include, inter alia, (1) isolating a gene or allelic or splice variants thereof in a cDNA library; 
(2) in situ hybridization (e.g., "FISH") to metaphase chromosomal spreads to provide precise 
chromosomal location of the gene, as described in Verma et al., Human Chromosomes: A 
Manual of Basic Techniques, Pergamon Press, New York (1988); and (3) Northern Blot 

30 analysis for detecting mRNA expression in specific tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 80%, 85%, 
90%, 95%, 96%, 97%>, 98%, 99% or 100% identical to the nucleic acid sequences disclosed 
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herein, which do, in fact, encode a polypeptide having a functional activity of a polypeptide 
of the invention. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill in the art 
will immediately recognize that a large number of the nucleic acid molecules having a 
5 sequence at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to, for 
example, the nucleic acid sequence of the cDNA in the related cDNA clone contained in a 
deposited library, the nucleic acid sequence referred to in Table 1 (SEQ ID NO:X), or 
fragments thereof, will encode polypeptides "having functional activity." In fact, since 
degenerate variants of any of these nucleotide sequences all encode the same polypeptide, in 

10 many instances, this will be clear to the skilled artisan even without performing the above 
described comparison assay. It will be further recognized in the art that, for such nucleic acid 
molecules that are not degenerate variants, a reasonable number will also encode a 
polypeptide having functional activity. This is because the skilled artisan is fully aware of 
amino acid substitutions that are either less likely or not likely to significantly effect protein 

15 function (e.g., replacing one aliphatic amino acid with a second aliphatic amino acid), as 
further described below. 

For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie et al., "Deciphering the Message in Protein Sequences: 
Tolerance to Amino Acid Substitutions," Science 247:1306-1310 (1990), wherein the authors 

20 indicate that there are two main strategies for studying the tolerance of an amino acid 
sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during the process of evolution. By comparing amino acid sequences in different 
species, conserved amino acids can be identified. These conserved amino acids are likely 

25 important for protein function. In contrast, the amino acid positions where substitutions have 
been tolerated by natural selection indicates that these positions are not critical for protein 
function. Thus, positions tolerating amino acid substitution could be modified while still 
; maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 

30 specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of single 
alanine mutations at every residue in the molecule) can be used. (Cunningham and Wells, 
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Science 244:1081-1085 (1989).) The resulting mutant molecules can then be tested for 
biological activity. 

As the authors state, these two strategies have revealed that proteins are surprisingly 
tolerant of amino acid substitutions. The authors further indicate which amino acid changes 
5 are likely to be permissive at certain amino acid positions in the protein. For example, most 
buried (within the tertiar}' structure of the protein) amino acid residues require nonpolar side 
chains, whereas few features of surface side chains are generally conserved. Moreover, 
tolerated conservative amino acid substitutions involve replacement of the aliphatic or 
hydrophobic amino acids Ala, Val, Leu and He; replacement of the hydroxyl residues Ser and 

10 Thr; replacement of the acidic residues Asp and Glu; replacement of the amide residues Asn 
and Gin, replacement of the basic residues Lys, Arg, and His; replacement of the aromatic 
residues Phe, Tyr, and Trp, and replacement of the small-sized amino acids Ala, Ser, Thr, 
Met, and Gly. Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, where 

15 the substituted amino acid residues may or may not be one encoded by the genetic code, or 
(ii) substitution with one or more of amino acid residues having a substituent group, or (iii) 
fusion of the mature polypeptide with another compound, such as a compound to increase the 
stability and/or solubility of the polypeptide (for example, polyethylene glycol), or (iv) fusion 
of the polypeptide with additional amino acids, such as, for example, an IgG Fc fusion region 

20 peptide, or leader or secretory sequence, or a sequence facilitating purification. Such variant 
polypeptides are deemed to be within the scope of those skilled in the art from the teachings 
herein. 

For example, polypeptide variants containing amino acid substitutions of charged 
amino acids with other charged or neutral amino acids may produce proteins with improved 

25 characteristics, such as less aggregation. Aggregation of pharmaceutical formulations both 
reduces activity and increases clearance due to the aggregate's immunogenic activity. 
(Pinckard et aL, Clin, Exp. Immunol. 2:331-340 (1967); Robbins et al.. Diabetes 36: 838-845 
(1987); Cleland et al.. Grit. Rev. Therapeutic Drug Carrier Systems 10:307-377 (1993).) 

A further embodiment of the invention relates to a polypeptide which comprises the 

30 amino acid sequence of a polypeptide having an amino acid sequence which contains at least 
one amino acid substitution, but not more than 50 amino acid substitutions, even more 
preferably, not more than 40 amino acid substitutions, still more preferably, not more than 30 
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amino acid substitutions, and still even more preferably, not more than 20 amino acid 
substitutions. Of course it is highly preferable for a polypeptide to have an amino acid 
sequence which comprises the amino acid sequence of a polypeptide of SEQ ID NO:Y, an 
amino acid sequence encoded by SEQ ID NO:X, and/or the amino acid sequence encoded by 
5 the cDNA in the related cDNA clone contained in a deposited library which contains, in order 
of ever-increasing preference, at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
amino acid substitutions. In specific embodiments, the number of additions, substitutions, 
and/or deletions in the amino acid sequence of SEQ ID NO:Y or fragments thereof (e.g., the 
mature form and/or other fragments described herein), an amino acid sequence encoded by 
10 SEQ ID NO:X or fragments thereof, and/or the amino acid sequence encoded by the cDNA in 
the related cDNA clone contained in a deposited library or fragments thereof, is 1-5, 5-10, 5- 
25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are preferable. 

Polynucleotide and Polypeptide Fragments 

15 The present invention is also directed to polynucleotide fragments of the breast, 

ovarian, breast cancer and/or ovarian cancer polynucleotides (nucleic acids) of the invention. 
In the present invention, a "polynucleotide fragment" refers, for example, to a polynucleotide 
having a nucleic acid sequence which: is a portion of the cDNA contained in a depostied 
cDNA clone; or is a portion of a polynucleotide sequence encoding the polypeptide encoded 

20 by the cDNA contained in a deposited cDNA clone; or is a portion of the polynucleotide 
sequence in SEQ ID NO:X or the complementary strand thereto; or is a polynucleotide 
sequence encoding a portion of the polypeptide of SEQ ID NO:Y; or is a polynucleotide 
sequence encoding a portion of a polypeptide encoded by SEQ ID NO:X or the 
complementary strand thereto. The nucleotide fragments of the invention are preferably at 

25 least about 15 nt, and more preferably at least about 20 nt, still more preferably at least about 
30 nt, and even more preferably, at least about 40 nt, at least about 50 nt, at least about 75 nt, 
at least about 100 nt, at least about 125 nt or at least about 150 nt in length. A fragment "at 
least 20 nt in length," for example, is intended to include 20 or more contiguous bases from, 
for example, the sequence contained in the cDNA in a related cDNA clone contained in a 

30 deposited library, the nucleotide sequence shown in SEQ ID NO:X or the complementary 
stand thereto. In this context "about" includes the particularly recited value or a value larger 
or smaller by several (5, 4, 3, 2, or I) nucleotides. These nucleotide fragments have uses that 
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include, but are not limited to, as diagnostic probes and primers as discussed herein. Of 
course, larger fragments (e.g., at least 150, 175, 200, 250, 500, 600, 1000, or 2000 
nucleotides in length) are also encompassed by the invention. 

Moreover, representative examples of polynucleotide fragments of the invention, 
5 include, for example, fragments comprising, or alternatively consisting of, a sequence from 
about nucleotide number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351- 
400, 401-450, 451-500, 501-550, 551-600, 651-700,701- 750, 751-800, 800-850, 851-900, 
901-950,951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251-1300, 
130M350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601-1650, 1651- 

10 1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951-2000,2001-2050, 
2051-2100, 2101-2150, 2151-2200, 2201-2250, 2251-2300, 2301-2350, 2351-2400, 2401- 
2450, 2451-2500, 2501-2550, 2551-2600, 2601-2650, 2651-2700, 2701-2750, 2751-2800, 
2801-2850, 2851-2900, 2901-2950, 2951-3000, 3001-3050, 3051-3100, 3101-3150, 3151- 
3200, 3201-3250, 3251-3300, 3301-3350, 3351-3400, 3401-3450, 3451-3500, 3501-3550, 

15 3551-3600, 3601-3650, 3651-3700, 3701-3750, 3751-3800, 3801-3850, 3851-3900, 3901- 
3950, 3951-4000, 4001-4050, 4051-4100, 4101-4150, 4151-4200, 4201-4250, 4251-4300, 
4301-4350, 4351-4400, 4401-4450, 4451-4500, 4501-4550, 4551-4600, 4601-4650, 4651- 
4700, 4701-4750, 4751-4800, 4801-4850, 4851-4900, 4901-4950, 4951-5000, 5001-5050, 
5051-5100, 5101-5150, 5151-5200, 5201-5250, 5251-5300, 5301-5350, 5351-5400, 5401- 

20 5450, 5451-5500, 5501-5550, 5551-5600, 5601-5650, 5651-5700, 5701-5750, 5751-5800, 
5801-5850, 5851-5900, 5901-5950, 5951-6000, 6001-6050, 6051-6100,6101-6150, and 6151 
to the end of SEQ ID NO:X, or the complementary strand thereto. In this context "about" 
includes the particularly recited range or a range larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either terminus or at both termini. Preferably, these fragments encode a 

25 polypeptide which has a functional activity (e.g., biological activity) of the polypeptide 
encoded by the polynucleotide of which the sequence is a portion. More preferably, these 
fragments can be used as probes or primers as discussed herein. Polynucleotides which 
hybridize to one or more of these nucleic acid molecules under stringent hybridization 
conditions or alternatively, under lower stringency conditions, are also encompassed by the 

30 invention, as are polypeptides encoded by these polynucleotides or fragments. 

Moreover, representative examples of polynucleotide fragments of the invention, 
include, for example, fragments comprising, or alternatively consisting of, a sequence from 
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about nucleotide number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351- 
400, 401-450, 451-500, 501-550, 551-600, 651-700,701- 750, 751-800, 800-850, 851-900, 
901-950, 951-1000, 1001-1050, 1051-1100, 1101-1150, 1151-1200, 1201-1250, 1251-1300, 
1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 1601-1650, 1651- 
5 1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901-1950, 1951-2000,2001-2050, 
2051-2100, 2101-2150, 2151-2200, 2201-2250, 2251-2300, 2301-2350, 2351-2400, 2401- 
2450, 2451-2500, 2501-2550, 2551-2600, 2601-2650, 2651-2700, 2701-2750, 2751-2800, 
2801-2850, 2851-2900, 2901-2950, 2951-3000, 3001-3050, 3051-3100, 3101-3150, 3151- 
3200, 3201-3250, 3251-3300, 3301-3350, 3351-3400, 3401-3450, 3451-3500, 3501-3550, 

10 3551-3600, 3601-3650, 3651-3700, 3701-3750, 3751-3800, 3801-3850, 3851-3900, 3901- 
3950, 3951-4000, 4001-4050, 4051-4100, 4101-4150, 4151-4200, 4201-4250, 4251-4300, 
4301-4350, 4351-4400, 4401-4450, 4451-4500, 4501-4550, 4551-4600, 4601-4650, 4651- 
4700, 4701-4750, 4751-4800, 4801-4850, 4851-4900, 4901-4950, 4951-5000, 5001-5050, 
5051-5100, 5101-5150, 5151-5200, 5201-5250, 5251-5300, 5301-5350, 5351-5400, 5401- 

15 5450, 5451-5500, 5501-5550, 5551-5600, 5601-5650, 5651-5700, 5701-5750, 5751-5800, 
5801-5850, 5851-5900, 5901-5950, 5951-6000, 6001-6050, 6051-6100, 6101-6150, and 6151 
to the end of the cDNA nucleotide sequence contained in the deposited cDNA clone, or the 
complementary strand thereto. In this context "about" includes the particularly recited range, 
or a range larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at 

20 both termini. Preferably, these fragments encode a polypeptide which has a functional 
activity (e.g., biological activity) of the polypeptide encoded by the cDNA nucleotide 
sequence contained in the deposited cDNA clone. More preferably, these fragments can be 
used as probes or primers as discussed herein. Polynucleotides which hybridize to one or 
more of these fragments under stringent hybridization conditions or alternatively, under lower 

25 stringency conditions, are also encompassed by the invention, as are polypeptides encoded by 
these polynucleotides or fragments. 

In the present invention, a "polypeptide fragment" refers to an amino acid sequence 
which is a portion of that contained in SEQ ID NO:Y, a portion of an amino acid sequence 
encoded by the polynucleotide sequence of SEQ ID NO:X, and/or encoded by the cDNA 

30 contained in the related cDNA clone contained in a deposited library. Protein (polypeptide) 
fragments may be "free-standing," or comprised within a larger polypeptide of which the 
fragment forms a part or region, most preferably as a single continuous region. 
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Representative examples of polypeptide fragments of the invention, include, for example, 
fragments comprising, or alternatively consisting of, an amino acid sequence from about 
amino acid number 1-20,21-40.41-60,61-80,81-100, 102-120, 121-140, 14M60, 161-180, 
181-200, 201-220, 221-240, 241-260, 261-280, 281-300, 301-320, 321-340, 341-360, 361- 
5 380, 381-400, 401-420, 421-440, 441-460, 461-480, 481-500, 501-520, 521-540, 541-560, 
561-580, 581-600, 601-620, 621-640, 641-660, 661-680, 681-700, 701-720, 721-740, 741- 
760, 761-780, 781-800, 801-820, 821-840, 841-860, 861-880, 881-900, 901-920, 921-940, 
941-960, 961-980, 981-1000, 1001-1020, 1021-1040, 1041-1060, 1061-1080, 1081-1100, 
1101-1120, 1121-1140, 1141-1160, 1161-1180, 1181-1200, 1201-1220, 1221-1240, 1241- 

10 1260, 1261-1280, 1281-1300, 1301-1320, 1321-1340, 1341-1360, 1361-1380, 1381-1400, 
1401-1420, 1421-1440, 1441-1460, 1461-1480, 1481-1500, 1501-1520, 1521-1540, 1541- 
1560, 1561-1580, 1581-1600, 1601-1620, 1621-1640, 1641-1660, 1661-1680, 1681-1700, 
1701-1720, 1721-1740, 1741-1760, 1761-1780, 1781-1800, 1801-1820, 1821-1840, 1841- 
1860, 1861-1880, 1881-1900, 1901-1920, 1921-1940, 1941-1960, 1961-1980, and 1981 to 

15 the end of SEQ ID NO:Y. Moreover, polypeptide fragments of the invention may be at least 
about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 100, 110, 120, 130, 
140, or 150 amino acids in length. In this context "about" includes the particularly recited 
ranges or values, or ranges or values larger or smaller by several (5, 4, 3, 2, or 1) amino acids, 
at either terminus or at both termini. Polynucleotides encoding these polypeptide fragments 

20 are also encompassed by the invention. 

Even if deletion of one or more amino acids from the N-terminus of a protein results 
in modification of loss of one or more biological functions of the protein, other functional 
activities (e.g., biological activities, ability to multimerize, ability to bind a ligand) may still 
be retained. For example, the ability of shortened muteins to induce and/or bind to antibodies 

25 which recognize the complete or mature forms of the polypeptides generally will be retained 
when less than the majority of the residues of the complete or mature polypeptide are 
removed from the N-terminus. Whether a particular polypeptide lacking N-terminal residues 
of a complete polypeptide retains such immunologic activities can readily be determined by 
routine methods described herein and otherwise known in the art. It is not unlikely that a 

30 mutein with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact, peptides composed of as few as six amino acid 
residues may often evoke an immune response. 
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Accordingly, polypeptide fragments of the invention include the secreted protein as 
well as the mature form. Further preferred polypeptide fragments include the secreted protein 
or the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1-60, can 
5 be deleted from the amino terminus of either the secreted polypeptide or the mature form. 
Similarly, any number of amino acids, ranging from 1-30, can be deleted from the carboxy 
terminus of the secreted protein or mature form. Furthermore, any combination of the above 
amino and carboxy terminus deletions are preferred. Similarly, polynucleotides encoding 
these polypeptide fragments are also preferred. 

10 The present invention further provides polypeptides having one or more residues 

deleted from the amino terminus of the amino acid sequence of a polypeptide disclosed 
herein (e.g., a polypeptide of SEQ ID NO:Y, a polypeptide encoded by the polynucleotide 
sequence contained in SEQ ID NO:X, and/or a polypeptide encoded by the cDNA contained 
in the related cDNA clone contained in a deposited library). In particular, N-terminal 

15 deletions may be described by the general formula m-q, where q is a whole integer 
representing the total number of amino acid residues in a polypeptide of the invention (e.g., 
the polypeptide disclosed in SEQ ID NO: Y), and m is defined as any integer ranging from 2 
to q-6. Polynucleotides encoding these polypeptides are also encompassed by the invention. 
Also as mentioned above, even if deletion of one or more amino acids from the 

20 C-terminus of a protein results in modification of loss of one or more biological functions of 
the protein, other functional activities (e.g., biological activities, ability to multimerize, 
ability to bind a ligand) may still be retained. For example the ability of the shortened mutein 
to induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptide generally will be retained when less than the majority of the residues of the 

25 complete or mature polypeptide are removed from the C-terminus. Whether a particular 
polypeptide lacking C-terminal residues of a complete polypeptide retains such immunologic 
activities can readily be determined by routine methods described herein and otherwise 
known in the art. It is not unlikely that a mutein with a large number of deleted C-terminal 
amino acid residues may retain some biological or immunogenic activities. In fact, peptides 

30 composed of as few as six amino acid residues may often evoke an immune response. 

Accordingly, the present invention further provides polypeptides having one or more 
residues from the carboxy terminus of the amino acid sequence of a polypeptide disclosed 
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herein (e.g., a polypeptide of SEQ ID NO:Y, a polypeptide encoded by the polynucleotide 
sequence contained in SEQ ID NO;X, and/or a polypeptide encoded by the cDNA contained 
in deposited cDNA clone referenced in Table 1). In particular, C-terminal deletions may be 
described by the general formula 1-n, where n is any whole integer ranging from 6 to q-1, and 
5 where n corresponds to the position of an amino acid residue in a polypeptide of the 
invention. Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

In addition, any of the above described N- or C-terminal deletions can be combined to 
produce a N- and C-terminal deleted polypeptide. The invention also provides polypeptides 

10 having one or more amino acids deleted from both the amino and the carboxyl termini, which 
may be described generally as having residues m-n of a polypeptide encoded by SEQ ID 
NO:X (e.g., including, but not limited to, the preferred polypeptide disclosed as SEQ ID 
NO:Y), and/or the cDNA in the related cDNA clone contained in a deposited library, where n 
and m are integers as described above. Polynucleotides encoding these polypeptides are also 

15 encompassed by the invention. 

Any polypeptide sequence contained in the polypeptide of SEQ ID NO:Y, encoded by 
the polynucleotide sequences set forth as SEQ ID NO:X, or encoded by the cDNA in the 
related cDNA clone contained in a deposited library may be analyzed to determine certain 
preferred regions of the polypeptide. For example, the amino acid sequence of a polypeptide 

20 encoded by a polynucleotide sequence of SEQ ID NO:X, or the cDNA in a deposited cDNA 
clone may be analyzed using the default parameters of the DNASTAR computer algorithm 
(DNASTAR, Inc., 1228 S. Park St., Madison, WI 53715 USA; http://www.dnastar.com/). 

Polypeptide regions that may be routinely obtained using the DNASTAR computer 
algorithm include, but are not limited to, Garnier-Robson alpha-regions, beta-regions, 

25 turn-regions, and coil-regions, Chou-Fasman alpha-regions, beta-regions, and turn-regions, 
Kyte-Doolittle hydrophilic regions and hydrophobic regions, Eisenberg alpha- and 
beta-amphipathic regions, Karplus-Schulz flexible regions, Emini surface-forming regions 
and Jameson-Wolf regions of high antigenic index. Among highly preferred polynucleotides 
of the invention in this regard are those that encode polypeptides comprising regions that 

30 combine several structural features, such as several (e.g., 1 , 2, 3 or 4) of the features set out 
above. 
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Additionally, Kyte-Doolittle hydrophilic regions and hydrophobic regions, Emini 
surface-forming regions, and Jameson-Wolf regions of high antigenic index (i.e., containing 
four or more contiguous amino acids having an antigenic index of greater than or equal to 
1 .5, as identified using the default parameters of the Jameson- Wolf program) can routinely be 
5 used to determine polypeptide regions that exhibit a high degree of potential for antigenicity. 
Regions of high antigenicity are determined from data by DNASTAR analysis by choosing 
values which represent regions of the polypeptide w^hich are likely to be exposed on the 
surface of the polypeptide in an environment in which antigen recognition may occur in the 
process of initiation of an immune response. 

10 Preferred polypeptide fragments of the invention are fragments comprising, or 

alternatively consisting of, an amino acid sequence that displays a functional activity of the 
polypeptide sequence of which the amino acid sequence is a fragment. 

By a polypeptide demonstrating a "functional activity" is meant, a polypeptide 
capable of displaying one or more known functional activities associated with a full-length 

15 (complete) protein of the invention. Such functional activities include, but are not limited to, 
biological activity, antigenicity [ability to bind (or compete with a polypeptide for binding) 
to an anti-polypeptide antibody], immunogenicity (ability to generate antibody which binds to 
a specific polypeptide of the invention), ability to form multimers with polypeptides of the 
invention, and ability to bind to a receptor or ligand for a polypeptide. 

20 Other preferred polypeptide fragments are biologically active fragments. Biologically 

active fragments are those exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity of the fragments 
may include an improved desired activity, or a decreased undesirable activity. 

In preferred embodiments, polypeptides of the invention comprise, or alternatively 

25 consist of, one, two, three, four, five or more of the antigenic fragments of the polypeptide of 
SEQ ID NO:Y, or portions thereof Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 
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Table 4 



Sequence/ 
Contig ID 


Epitope 


508678 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 422 as 
residues: GIn-21 loAra-43. 


508968 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 423 as 
residues; Thr-1 to Lys-6. 


509029 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 424 as 
residues: Asp-l to Trp-8, Thr-!2 toCys-l9. Pro-4l to Leu-51. 


522632 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 426 as 
residues: Cys-69 to Asn-74. Lys-83 to Glv-89. 


524655 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 427 as 
residues: Tyr-28 to Asn-35. lle-45 to Lvs-55. 


525847 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 428 as 
residues: Lys-27 to Asp-33. 


530306 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 429 as 
residues: Ara-l to Arg-l !, Tyr-21 to His-27. 


532818 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 430 as 
residues: Pro- 1 0 to Thr-2 1 . Asp-32 to Thr-38. Glv-47 to Glu-60. 


533385 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 43 1 as 
residues: Asn.l7 to Trp-22. Pro-34 to Glu-49. His-61 to Ser-7I. 


533532 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 432 as 
residues: Glu-29 to Lys-37, Lys-110 to lIe-1 18, Arg-126 loCys-135, Lys-157 toGly- 
163. Gin- 1 88 to Trp-201. Glu-269 to Thr-278. 


534852 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 433 as 
residues: Gln-1 to Ser-14. Thr-23 to Val-3 K Cys-43 to Ala-56, Glu-58 to Ser-96, Gly- 
101 to Tyr-109, Asn-143 to Tyr-148, Pro- 1 54 to His- 1 64, Ser-195 to Asn-201, Pro-264 to 
Pro-271. 


537910 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 434 as 
residues: Pro-4 to AIa-1 1, Pro- 1 10 to Are- 122. 


539577 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 436 as 
residues: Pro-9 to Gin- 1 9. 


548595 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 439 as 
residues: Asp-27 to Asp-33, His-54 to Tyr-59. Ile-91 to Pro-96. 


549337 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 440 as 
residues: Pro-38 to Asp-43, Arg-l 55 to Phe-162, Pro- 164 to Asp- 1 70, Pro- 172 to Gly- 
182. 


553091 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 442 as 
residues: Lys-55'to Lys-62, GIn-67 to Val-76, Lys-101 to Glu-1 1 1, Lys-125 to Arg-140, 
Arg-l 61 to Arg-l 66, Gln-1 71 to Asp- 187. 


553827 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 443 as 
residues: Glu-17 to Pro-22. Pro-70 to His-76, Thr-84 to Arg-92, Asp-109 to Tyr-1 17, 


556350 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 444 as 
residues: Glu-I to Ser-15, Phe-17 to Pro-22, Lvs-l 16 to Arfi-131. 


556351 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 445 as 
residues: Gln-9 to Phe-23, Cys-53 to Ser-64, Glu-86 to Asp-93, lie- 1 00 to Glu-1 1 2, Tyr- 
124 to Glu-133, Ser-197 to Ser-204, Asn-208 to Glu-214, Lys-228 to Lys-233. Tyr-248 
to Lys-259, Pro-330 to Ala-335, Gln-349 to Lvs-355, Ala-365 to Glu-374, Ser-376 to 
Ser-397. 


557007 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 446 as 
residues: Pro-46 to Tyr-54, Pro-8i to GIy-87, Pro-97 to Gly-104, Leu- 106 to Asn-1 16, 
Asn-129 to Phe-134, Lys-147 to Tyr-1 58, Ala- 192 to Ser-199, Asp-204 to Glu-215. Gly- 
221 to Ser-232. 


558456 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 448 as 
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residues; Glu-19 to Tvr-24. Ser-60 to Thr-65, Tlir-82 to Pro-88. 


558708 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 449 as 
residues: Ar&-13 to Ala-20. Pro-27 to Art5-32, Lvs-37 to Glu-62. 


574789 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 450 as 
residues: GIv- 1 6 to Lvs-2 1 . 


578203 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 451 as 
residues: Thr-7 to Arg-18. 


588869 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 453 as 
residues: Pro-14 to Ser-19, Glu-55 to Phc-60, Asp-93 to Ser-98, Thr-138 to Tyr-144, 
Asn-155 to Phe.163, Arc- 168 to Ser-175, Gln-205 to Lys-210, Phe.226 to Thr-233. 


597076 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 454 as 
residues: Ser-50 to Gln-56. 


598656 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 455 as 
residues: Ser-85 to Tyr-92, Ara-109 to Lys-l 14. 


614329 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 457 as 
residues: Ars-59 to AUv67. Asn-78 to Arg-85. 


620956 


Preferred epitopes include those comprising a sequence shown in SEQ iD NO. 459 as 
residues: Ala-1 1 to Gln-l 6. 


621889 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 460 as 
residues: Scr-84 to Giv-99, Pro- 101 to Ser-1 12. 


651784 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 462 as 
residues: Gly-29 to Glv-35, Ala-37 to Ala-48. 


651826 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 463 as 
residues: Arg-i to Ser-16, Gln-49 to Lys-60, Glu-77 to Leu-83, Gln-91 to Arg-lOO, Phe- 
140 to Ala-154. Asp-214 to Leu-219, Ala-258 to Met-275, lle-289 to Lys-295, Ala-314 
to GIu-320, Are-327 to Met-332. Thr-383 to Ser-388, Ser-425 to Asp-433. 


653282 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 464 as 
residues: Are- 12 to lie- 19. GIu-23 to Pro-29, Pro-37 to Val-45. 


657122 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 465 as 
residues: Ala-6 to GIy-13, Arg-41 to Thr-47. 


•661442 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 466 as 
residues: Arg-6 to Ser-Il, Asp-53 to Ser-59, Ala-88 to Ala-104, Thr-l 14 to Asn-121, 
GIu-128 to Val-137, Asn-144 to Thr-150, Ser-174 to Asn-180, Gly-203 to Asp-212. 


664914 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 467 as 
residues: Pro- 1 2 to Lys- 1 7. 


666654 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 468 as 
residues: Thr-5 to Leu- 10, Pro- 13 to Leu-24. 


667084 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 469 as 
residues: Pro-1 to Pro-9, Gly-50 to Ser.55, Gly-80 to Ser-85, Gly-91 to Tyr-96, Arg-144 
to Gin- 160, Asp- 1 95 to Thr-202, Lys-246 to Glu-252, Met-283 to Glu-288, Glu-292 to 
Glu-299, Ser-304 to Asn-310, Ala-356 to Tyr-362, Met-387 to Tyr-394, Gln-424 to Thr- 
431,Ser-450 to Arg-459. 


667380 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 470 as 
Residues: Pro-l to Pro-6, Thr-l 34 to Gin- 140, Tyr-142 to Arg-lSO. 


671315 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 472 as 
residues: Ala- 16 to Gly-21, Glu-28 to Gly-35. 


671993 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 473 as 
residues: Pro-8 to Ser-23. 


674618 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 474 as 
residues: Ile-3 to Ser- 1 1, Arp-24 to Glu-30. 


675027 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 475 as 
residues: His-47 to Ile-52, Ala-71 to Arg-76, Asp-78 to Lvs-87. 


677202 


Preferred epitopes include tho.se comprising a sequence shown in SEQ ID NO. 476 as 
residues: VaI-45 to Gly-50, Thr-56 to Glu-64. 


678504 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 477 as 
residues: Arg- 7 to Ser-19. 
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678985 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 478 as 
residues: Lys-17 to Thr-23. Leu-26 to His-36, His-41 to Pro-56. Ala-60 to Gly-71, Lys- 
77 to Ser-91. Asp- 101 to Lys-109. A5p-200 toGly.206. Asp-245 to Leu-253. bln-262 to 
Phe-274. 


682161 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 479 as 
residues: Arg-5 to Pro-l 1. Pro-22 to Thr-29, Trp-53 to Arg-62, Pro-69 to Gly-78, Lys-98 
to Tvr.l03. Glu-144 to His-151. Pro-l 72 to Leu- 1 78. Gin- 1 93 to Glu-200. 


683476 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 480 as 
residues: Ala-5 to Trp-I9. 


693589 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 482 as 
residues: Cys-1 to Arg-13, Pro-15 to GIy-2^1 . Gly-54 to Ser-59, Trp-73 to Lys-78, Ser-90 
to Arg-104. 


694991 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 483 as 
residues: Lvs-1 to Thr-6. Pro-8 to Gly-19. Val-61 to Arg-66. 


698669 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 485 as 
residues: Pro-3l to His-36. Gly-43 to Tyr-48, Glu-136 to Ser-142, Pro-178 to Arg-183, 
Pro-273 to Asp-278. Gly-3 1 8 to Cys-326. 


707357 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 488 as 
residues: Gly-6 to Ara-21. Are-89 to Asp-94. 


707360 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 489 as 
residues: Ser- 1 3 to Glu-26, Ser-48 to Val-55, Lys-85 to Thr-9 1 , Asp- 1 1 5 to Trp- 1 20. 


707375 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 490 as 
residues: Arg-1 to Gly-6, Ala- 12 to Arg-19, Arg-34 to Arg-40, Arg-47 to Ala-5 8, Ser-67 
to Thr-80, Scr- 1 09 to Ser- 1 1 7. Asn- 1 34^0 Ser- 14 1 , Pro- 1 75 to Arg- 1 8 L Lys-2 1 2 to Thr- 
218, Asp-275 to Cys-285. 


707754 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 491 as 
residues: Val-32 to Leu-4L Asn-55 to Arp-63. Pro-104 to Ala-113. 


712248 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 493 as 
residues: Ser- 13 to Gly-20, Gln-36 to Ser-4l, Pro-44 to Phe-58. 


715445 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 494 as 
residues: Gly-23 to Thr-29, Scr-32 to Vai-40, Lys-181 to Ser-188, Glu-197 to Gln-204, 
Ara-244 to His-249, Ala-253 to Thr-264. 


716362 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 495 as 
residues: Cys-I to Gly-8, Ar^-1\ to Ser-77, His-102 to Ser-108. 


716835 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 496 as 
residues: Gln-7 to Glu-14, Ala-24 to Are-4L 


717685 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 498 as 
residues: Gly-1 to Ala-7, His-70 to Gly-76^ Gln-130 to Thr-135, Thr-182 to Pro-189, 
Asn-259 to Leu-267, Glu-280 to Ala-289, Gln-303 to Asn-310. 


719755 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 499 as 
residues: Asp-14 to Pro-25, Pro-59 to Glu-lOO, Cys-126 to Gly-145, Pro-158 to Lys-164, 
Lys-176 to Leu- 197, Leu-221 to Tyr-238. 


720389 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 500 as 
residues: Thr-!3 to Ala- 19, AIa-26 to Pro-36, Ser-63 to Gly-68- 


720903 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 501 as 
residues: Asn-6 to Ser-I I, Ala-91 to Arg-99, Trp-107 to Tyr-1 13, Tyr-131 to Met-137, 
Asp-150 to Vai-157. 


721562 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 503 as 
residues: Asp-39 to lle-45. 


722775 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 504 as 
residues: Pro-34 to Ser-4l. Cys-49 to Arg-55, Thr-92 to Ala-98, Thr-160 to Gly-1 73. 
Thr-194 to Pro-200, Gly-274 to Trp-282. Pro-285 to Ala-291. 


724463 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 505 as 
residues: Glu-9 to Lys-15, Pro-23 to Tyr-33. 


728418 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 507 as 
residues: Ala-6 to Gln-1 1. Ser-25 to Ser-30, Lys-63 to Gly-69, Ser-108 to Asp- 1 18, Arg- 
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1 27 to His- 1 32. Asp- 1 56 to Cvs- 161. 


728920 


Preferred epiiopes include those comprising a sequence shown in SEQ ID NO. 508 as 
residues: Thr-7 to Ala-15. 


732958 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 509 as 
residues: Thr-10 to Ala-15. Pro-63 lo Ser-7S. Scr-82 to Leu-94. 


733134 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 510 as 
residues: Arg-4 to Gly-24, Lys-47 to Phe-55. Lys-61 to Ala-67, Gly-108 to Thr-1 14, 
Pro- 184 to Pro-191. Pro-292 to Arg-299. Pro-355 toGlu-392. 


734099 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 51 1 as 
residues: His- 1 to Arp-?. Gin- 15 to Ala-23, Met-43 toGln-55. 


738911 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 515 as 
residues: Are-4 to Asp-10. Ser-64 to His-75. Pro-127 to Asn-136. Phe-143 to Gln-150. 


739226 


Preferred epiiopes include those comprising a sequence shown in SEQ ID NO. 516 as 
residues: Asn-l to Thr-7. 


739527 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 517 as 
residues: Glv-I to ArG-9. Val-28 to Glv-39. Asp-52 to Leu-60, Ala- 106 to Trp-1 17. 


744331 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 520 as 
residues: Scr-17 to Ar2-24. 


744751 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 521 as 
residues: Ser-8 to Val-I3. Pro-34 to Cys-40. Tyr-48 to Ser-55. Glv-63 to Ser-73. 


745750 


Preferred epiiopes include those comprising a sequence shown in SEQ ID NO. 522 as 
residues: Ser-2 to Glu-17. 


746285 


Preferred epiiopes include those comprising a sequence shown in SEQ ID NO. 523 as 
residues: Lys-87 to Lvs-92. 


746416 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 524 as 
residues: Are-6 to Leu- 12. Tyr-18 to Asp-25. 


747851 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 525 as 
residues: Gly-124 to Ser-129. Leu-162 to Gly-167, Val-272 lo Ala-278, Lys-293 to Asp- 
298. 


751315 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 527 as 
residues: Cys-12 to Pro-20. 


754634 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 529 as 
residues: Asp- 1 to Thr-10. 


756833 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 53 1 as 
residues: Thr-36 to Pro-49, Glu-52 to Pro-67. 


756878 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 532 as 
residues: Pro-8 to Lvs- 1 5. Glv-69 to Trp-75. 


757332 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 533 as 
residues: GIn-23 to Val-3 1, Phe-39 to lle-52. 


760835 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 534 as 
residues: Phe-1 to Lys-7. Cys-82 to Ser-90. 


761760 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 535 as 
residues: Arg-34 to Pro-39, Gly-43 to Asp-51, Gln-147 to Arfi-153. 


762520 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 536 as 
residues: His-6 to His- 1 1, Ala- 13 to Glu-18, AIa-60 to Ser-65, Ile-72 to Ser-77, GIn-95 
0 Phe-1 01, Leu- 136 to Ser-142. 


764461 

I 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 537 as 
-esidues: Val-15 to Ala.22, Val-26 to Glv-38. 


764517 J 
I 


Preferred epiiopes include those comprising a sequence shown in SEQ ID NO. 538 as 
esidues: Gly-30 to Lys-36, Gly-94 to Ala-IOO, Gln-150 to Gly-156, GIn-189 to Leu- 
195. 


765132 I 
r 


=*refcrred epitopes include tho.se comprising a sequence shown in SEQ ID NO. 539 as 
■esidues: Asn-80 to Thr-87, Ser-165 to Lcu-182, Thr-196 to His-201, Lys-271 to His- 
119, Asp-286 to Glv-292, Tvr-294 to Leu-302. 


765667 I 
r 


^referred epitopes include those comprising a sequence shown in SEQ ID NO. 540 as 
esidues: Pro-i4 to Pro-21, Pro-30 to Pro-36. 
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767113 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 541 as 
residues: Ala-62 to Pro-73. Pro-75 to Thr-83. Thr-1 10 to Phe-l 15, Glu-142 to Asp-150. 
GIn-158 to Ser-167. Glu-182 to Thr-187, Ser-190 to Asp-204. 


767204 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 542 as 
residues: Ala-22 to Met-29, Arg-45 to Pheo6, Asp-63 to Asp.71, Gly-81 to Ala-SS, Gln- 
155 to Trp-162. 


767962 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 544 as 
residues: Ghi-126 to Glv-132. Asn-146 to Ser-158, Phe-179 to Leu- 188. 


768040 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 545 as 
residues: Pro-24 to Trp-32, Val-5l to Arg-62, Gly-84 to Asp-93, Asp- 108 to Asn-120, 
Glu- 1 50 to Val- 1 58, Glv- 1 69 to Glv- 1 75 . 


769956 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 546 as 
residues: Pro-l to Are-6. 


770133 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 547 as 
residues: Glu-1 to Ser-6. 


771964 


Prefened epitopes include those comprising a sequence shown in SEQ ID NO. 549 as 
residues: Pro-8 to Gly-I5. Thr-26 to Phe-32. Thr-102 to Ser-109, Ala-1 12 toThr-l 18, 
His-130 to Glu-152, Scr-161 to Ala-170, Ser-204 to His-209, Gly-221 to Ser-229, Ser- 
233 to Ala-240, Glu-242 to Pro-247. Leu-251 to Gln-258, Lcu.278 to Leu-285, Thr-333 
to GIu-338. 


773387 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 551 as 
residues: Lys-36 to Lys-45, Ala-59 to Arg-67. Cys-99 to Arg-108, Ala-1 15 to Cys-125, 
Are- 143 to Are-153. 


773827 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 552 as 
residues: Pro-l to Ala-15, Ser-72 to His-79, Gly-89 to Tyr.105, Lys-179 to Lys-184, 
Arc-246 to Asp-25 1. Glu-302 to Lys-309, Ser-329 to Phe-341. 


774108 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 553 as 
residues: Ala-1 to Gly-2 1, Pro-28 to Leu-39, Pro-48 to Asp-62, ArR-7I to ArK-78. 


775339 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 555 as 
residues: Asp-6 to Thr-1 3, Asp-24 to Met-30. 


775582 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 556 as 
residues: Gly-1 to Asn-12, Ser-69 to Glu-77. 


777809 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 558 as 
residues: Arg-15 to Gly-25. 


778927 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 559 as 
residues: Aia-74 to Ser-82, Asn-109 to Ala- 124, Ser-147 to He- 152, Pro- 188 to Gly-194, 
Arg-290 to Pro-299, Tyr-307 to Glu-319, Tyr-341 to lle-346, Lys-423 to Ser-441. Gln- 
452 to Glu-465. 


779262 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 560 as 
residues: Are-5 to lIe-24, Gly-35 to Trp-40, Glu-42 to Thr-48, Lys-76 to Gly-95. 


780149 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 562 as 
residues: Gly-1 3 to Gin- 18, Pro-71 to Glu-89, lie- 134 to Asp- 139, Pro-232 to Met-240. 


780583 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 563 as 
residues: Asn-58 to Thr-64, Ile-72 to Ser-78, Gly-l 19 to Lys-128. 


780960 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 564 as 
residues: Ala-7 to lie- 14, Lys-27 to Asp-35, Thr-63 to Leu-73. 


781469 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 565 as 
residues: Pro-l to Ala-12, Arg-27 to Gln-45, Are-57 to Gln-64, Lys-74 to AsD-96. 


781771 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 567 as 
residues: Glu-38 to Leu-52, Glu-64 to Lys-72, Asn-92 to Ala-102, Ala-104 to Asp-1 19, 
Pro-l 21 to Pro- 130, Ser-165 to Ser-173. 


782033 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 568 as 
residues: Ala-1 to Gly-19, Gln-41 toGly-46. 


782105 


^referred epitopes include those comprising a sequence shown in SEQ ID NO, 569 as 
residues: Leu- 13 to GIy-34. ArR-77 to Pro-85, Lys-129 to Are- 135. 


782122 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 570 as 
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residues: Pro-l to Arg-6, Ala- 102 to Ala- 1 08, Pro- 1 48 to Asp- 158, Gly-164 to Aia-171. 
Pro-223 to Asn-23L Pro-272 to Ser-282. Ala-294 to Pro-310. Pro-322 to Ar^-327. 


783245 


Prefen-ed epitopes include those comprising a sequence shown in SEQ ID NO. 572 as 
residues: Leu-90 to Aru-97, Ala-107 to Pro-l 13. 


783247 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 573 as 
residues: Scr-2 to Leu-8. 


783413 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 574 as 
residues: Lvs-33 to Val-39. 


784407 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 575 as 
residues: Gly-28 to Val-36. 


784548 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 576 as 
residues: Trp-1 to Pro-9, Pro- 15 to Gln-24. Pro-52 to Thr-57. 


785677 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 578 as 
residues: Gly-7 to Giy-14. 


786238 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 579 as 
residues: GIv*l to Gly-8. 


786389 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 580 as 
residues: Ser-2 to Arg-I6, GIy-34 to Glu-44, Arg-62 to Gln-69, Pro- 102 to He- 108, Asp- 
1 87 to Thr- 1 93, Leu.203 to Pro-2 13. 


786929 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 581 as 
residues: Pro-2 to Trp-7, Tyr-36 to Tyr-43. 


786932 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 582 as 
residues: Scr-18 to His-30, Thr-39 to Arg-5l, Leu-59 to Thr-66. Pro-131 to Lys-136, 
Pro-l 49 to Ser-157. 


787078 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 583 as 
residues: Glu-20 to Pro-26. 


787283 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 585 as 
residues: Glu-7 to Arg-13, Gln-26 to Arg-34. 


788988 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 587 as 
residues: Pro-41 to Tyr-50, Thr-70 to Lys-75. 


789092 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 588 as 
residues: Thr-27 to Ala-34, Leu-41 to Glu-48, Glu-76 to Asn-87, Asn-l 10 to Leu-llS, 
Gly-125 to Lys-133. 


789298 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 589 as 
residues: Are-1 to Ser-14, Glu-56 to G1y-6I, Ala-92 to Gln-98. Glu-134 to Val-154. 


789718 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 591 as 
residues: Cys-17 to Ala-24. 


790285 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 594 as 
residues: Thr-1 1 to Leu- 18, Leu-22 to Val-31, Trp-33 to Lys-49, Ser-63 to GIu-72, Cys- 
80to Ala-9KPro-97 to His- 11 6. 


790509 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 595 as 
residues: Ser-6 to His-20, Leu-22 to Gly-32, Lys-103 to Arg-l 11, Ser-125 to Gly-130, 
Glu-204 to His-210, Thr-213 to His-219, Pro-222 to Asp-244, Ser-250 to Glu-258, Arg- 
263 to Arfi-268. 


790775 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 596 as 
residues: Arg-42 to Asp-48, Cvs-79 to Thr-85, Leu- 1 13 to Ser-123. 


790888 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 597 as 
residues: Pro-14 to Asp-19, Asp-40 to Leu-45, Ser-53 to Val-58, Leu-81 toTyr-91. 


791506 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 598 as 
residues: Arg-l to Gly-9, Asp-i9 to His-25, Gly-51 to Glu-61. 


792002 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 601 as 
residues: Arg-l to Gly-6, Val-22 to Pro-35, Val-106 to lle-l 12, His-lIS to Gln-124, Ser- 
1 32 to Leu- 1 45, Asn- 1 64 to Asn- 1 70, Are- 1 87 to Tyr- 1 92. 


792291 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 602 as 
residues: Pro-14 to Arg-31. 


792371 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 603 as 
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residues: Gly-37 to Gly-52, Pro-63 to Gly-69, Ser-74 to His-8K Ser-94 to Thr-105, VaU 
109 10 Thr-1 14. Phe-165 to Ser-181. Ala-191 to Asp- 196. Asn-209 to Ser-216. 


792660 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 604 as 
residues: Thr-1 1 to Arc- 16. Asn-78 to Asp-84. 


792782 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 605 as 
residues: Ala-65 to Glv-8 1 . 


792890 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 606 as 
residues: Pro-26 to His-3 1. Ara-34 to Ser-44, Pro-59 to Ser-7U Leu-77 to Gly-83. 


792931 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 607 as 
residues: Pro-3 to His-12, 


792943 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 608 as 
residues: Lys-3 to Tyr-9, Gly-15 to Thr-22, Leu-36 to Asp-41, Leu-67 to Lys-76, Asp-86 
to Ser-93. Tyr-174 to Asp- 184. Leu-255 to GIu-260, lle-331 to Val-337. 


793446 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 611 as 
residues: His-l to Gly-12. 


793639 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 612 as 
residues: Arg-6 to Arc- 1 3, Pro-47 to Val-52, Gln-57 to Are-65, Are-72 to Glu-78, Asp- 
1 1 7 to Thr- 1 24, Phe- 1 32 to His- 1 3 7. 


794213 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 613 as 
residues: Tvr-1 to Trp-9, Thr-44 to Leu-49. 


795955 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 615 as 
residues: Lys-60 to Lys-65, Lvs-99 to Ala- 104. 


796555 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 61 7 as 
residues: Ser-1 to Gly-IO, Glv-90 to Gly-97, Asn-185 to Arg-197, Pro-202 to Are-211. 


796675 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 618 as 
residues: Ser-35 to GIy-40, Ser-103 to His-109, Tyr-151 toGly-159, Pro-216 to Glu- 
224, Asn-249 to Trp.258, Pro-278 to Glu-284. 


796743 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 619 as 
residues: Asn-l to Gly-6, Asn-100 to Glu-106, Gln-108 to Asp-1 16, Asp- 146 to Thr- 
I51,Thr-l91 toGlu-198. 


796792 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 620 as 
residues: Asn-23 to Gly-28, Cvs-41 to Asp-47, Gln-82 to Glu-88. 


799668 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 621 as 
residues: Gly-2 to Are- 10, ne-27 to Pro-33. 


799669 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 622 as 
residues: Gly- 1 to Ser-1 2. 


799673 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 623 as 
residues: Gly-1 to Ala-14. Leu-38 to Pro-46. 


799674 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 624 as 
residues: Pro-39 to Pro-45. 


799678 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 625 as 
residues: Lys-54 to Ser-60, Tyr-86 to His-93. 


799728 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 626 as 
residues: Trp-7 to Gin- 19. 


799748 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 627 as 
residues: Glu-7 to Are- 12, Lvs-62 to His-68. 


799760 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 628 as 
residues: ile-15 to Trp-22. 


800296 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 630 as 
residues: Asn-l 9 to Thr-39, Glu-42 to Ile-48, Arg-55 to Asp-66, He- 130 to Arg-135, 
Lys-149 to Ala-156, GIu-166 to Leu-176, Met-213 to Lys-219, Pro-233 to Pro-248, Lys- 
258 to Lvs-263. 


800327 

■ 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 631 as 
residues: Arg-13 to Gly-19, Lys-32 to Glu-39, Lys-94 to Trp-100, Asn-102 to Asp-108, 
Ala-1 17 to Leu- 129. 


800816 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 632 as 
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residues: Lvs-I to lle-i I. Gln-36 to Leu-46. 


800835 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 633 as 
residues: Trp-l to Gln-l 1, Gly-37 to Gln-50- Ser.l09 to GIn-1 14, Glu-146 to Leu-155, 
Glu- 1 75 to Gly- 1 80. Thr-1 88 to Ser-200. 


805429 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 634 as 
residues: Pro-6 to Scrol. Gin- 1 00 to Glu-l07. 


805458 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 635 as 
residues: GUi-57 to Ser-62, Thr-102 to Ser-120. 


805478 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 636 as 
residues: Glu-31 to Glu-37, Pro-47 to Ser-52, Asn-57 to Asn-66. 


805805 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 637 as 
residues: Are- 1 to Cvs-16. Tyr-59 to Lvs-68. Glu-76 to Ar.e-82. 


806486 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 638 as 
residues: Phe-1 to Val-6, Pro-i 1 to Gly- 18. 


806498 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 639 as 
residues: Pro-6 to Ser-1 7. Arg-81 to Thr-88, Arg-198 to Val-203, Arg-285 to Arg-296, 
Gln-302 to Ser-361, Leu-399 to Ser-407. 


810870 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 641 as 
residues: Val-12 to Ile-21. 


811730 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 642 as 
residues: Are-33 to Ara-40. 


813262 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 645 as 
residues: Gly-3I to Asp-5L Cys-68 to Val-81. Leu-85 to Cys-92. 


815637 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 646 as 
residues: Art;- 13 to Asp- 1 9, Ser-SO to Gly-9L Pro-99 to Ser-l 11. 


815853 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 647 as 
residues: Cys-25 to Ser-3I, Gln-63 to Asp-73, Arg-98 to Gly-106. Pro-120 to Arg-125, 
Leu- 136 to Asp-141 . Glv-155 to Glu- 170, Phe-179 to Gly- 186. 


815999 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 648 as 
residues: Asp-1 to Asp- 10, Arg-19 to Glu-28, Gly-86 to Leu-93, Arg-1 13 to His- 11 8. 


823427 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 649 as 
residues: Pro- 16 to Cvs-27, Ar^e-70 to Ar^e-76, 


823704 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 650 as 
residues: Val-29 to Lvs-34, Arg-58 to His-63, Gln-87 to Lys-97, Arg-1 95 to Ser-200. 


824798 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 651 as 
residues: Thr-28 to His-34. 


825018 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 652 as 
residues: Gln-l to Asn-l 1, Leu- 19 to Thr-24, Lys-47 to Arg-55, Lys-94 to Asp-99, Ala- 
101 to Arg-107, Ala- 137 to Tyr-146, Gin- 150 to Ser-1 63, Gly- 169 to Lys-175, Thr-1 82 
to Ala-189, Glu-249 to Ser-258, Pro-266 to Tyr-275, Tyr-285 to Gly-298, Asp-302 to 
Gln-315, Tyr-318 to Thr-325, Gln-332 to Ala-359, Ser-372 to Phe-384, Leu-390 to Ala- 
399, Ala-428 to Arg-437. 


825787 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 654 as 
residues: Pro-21 to Leu-28, Arg-40 to IIe-49, Asp-84 to Asn-93, Arg-124 to Asn-130, 
Gly-140 to Asn-145, Leu-187 to GIn-196, Pro-208 to Asp-213, Arg-244 to Asp-252, lle- 
325 to GIn-336, Glu-372 to Ala-379, Asn-435 to Leu-446. Ala-460 to Arg-467, Val-500 
to Asp-506, Lys-524 to Asn-533, Thr-592 to Lys-598, Asp-648 to Ser-656. 


826116 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 655 as 
residues: Glu-20 to Cvs-35. 


826147 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 656 as 
residues: Lys-18 to Leu-24. 


827586 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 658 as 
residues: Ser-7 to Glv-14, Leu-22 to Ala-28. Thr-57 to Ser-62. 


827735 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 660 as 
residues: Pro-2 to Ser-I2, Gln.25 to G1U-3L Val-40 to ArK-45. 


827740 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 661 as 
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residues: Ilc-22 to Lys-28. 


827808 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 662 as 
residues: Glu-2 to Gin- 13, Gln-20 to Giy-29, Arg-32 lo Cys-47, Pro-54 to Trp-61, Thr- 
73 to GIn-91, Gly-96 to Ser-103, 


828357 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 664 as 
residues: Gly-1 to Gly-10. Val-25 to Glu-32. His-67 to Are-73. 


828612 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 666 as 
residues: Asp-25 to Gln-3U Asp-36 to Tyr-41, Gln-43 to Thr-48, Lys-71 to Thr-76. 


828647 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 667 as 
residues: Ser-2 to Ser-8, Arg-61 to Gin-74, Ser-192 to Asn-202, GIn-229 to Lys-236, 
Gly-281 to Gly-292, Glu-333 to Ala-345, Ala-352 to Gln-358, Glu-360 to Leu-366, Asp- 
443 to Ser-449, Glu-452 to Glu-459, Asp-485 to Thr-492, Ala-510 to Gln-516, Ala-545 
to Ala-552, Leu-560 to Thr-566, Gluo86 to Ala-592, Asp-601 to Gln-607, Leu-609 to 
Leu-620. 


828698 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 668 as 
residues: Pro-28 to Ser-43, Pro-45 to AIa-50, His-58 to GIn-63. 


828962 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 669 as 
residues: Ala-42 to Gly-49, Thr-54 to Cys-63. 


829282 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 671 as 
residues: Ser-7 to Gin- 12, Gly-25 to Gly-3I, Gly-7l to Gly-84, Leu- 147 to Glu-164, 
Trp-l72to Leu-180. 


829368 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 672 as 
residues: Glu-1 to Tyr-7, Pro- 13 to Glu-24, Arg-31 to lle-39, Gln-59 to Lys-65, His-67 
to Leu-74. 


829751 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 673 as 
residues: Ala-29 to Arg-45, Ser-48 to Glu-59, Lys-73 to Trp-79, AIa-100 to Ser-109. 


829934 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO, 675 as 
residues: Arg-1 to Arg-6, Ser-46 to Asp-71 , Glu-76 to Glu-90, Gin- 107 to Tyr-1 18, Ser- 
124 to Asp-I3I, Glu-163 to Asp- 170, Ala-239 to Asp-245, Asp-262 to Arg-268, G!n-276 
to Asp-283, Arg-293 to Lys-300, Ser-307 to Glu-3 13. Phe-346 to Phe-351, Phe-361 to 
Ala-373. 


829951 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 677 as 
residues: Thr-21 to Lys-28. 


830173 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 678 as 
residues: Gly-5l to Asn-68, Thr-75 to Lys-82, Ala-86 to Ala-97, Asn-99 to Arg-106, 
Leu-121 to Phe-126, Ala- 1 55 to Ser-163, Asp- 175 to Asp- 1 80, Ala- 184 to Phe-196, Leu- 
204 to Asn-214, Asp-219 to Gln-232, Leu-269 to Arg-274, Pro-392 to Pro-400, Thr-430 
to Asn-437, Tyr-472 to Gln-477, Leu-483 to Gln-499, Asn-516 to Gln-524, Ser-533 to 
Gln-546, Lys-562 to Glu-576, Leu-589 to Ala-594, Asp-624 to Ala-633, Ue-741 to Asp- 
746, Val-817 to Lys-839, Tyr-872 to Lys-878, Thr-929 to Asp-940. 


830365 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 680 as 
residues: Trp-36 to Glu-41, Asp-71 to ArK-76, Asn-80 to GIy-87, Arg-103 to Pro-1 15. 


830456 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 68 1 as 
residues: Leu-48 to Cys-54. 


830549 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 682 as 
residues: Ser-l to Pro-24, Pro-40 to Thr-50, Glu-62 to Gly-83, Arg-103 to Leu-108, Ser- 
141 to Lys-146, Lys-184 to Ser-l 90. 


830602 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 683 as 
residues; Arg-53 to Thr-63, He- 1 00 to Lys-108, 


830610 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 684 as 
residues: Pro-27 to Cys-32, Ala-61 to Gly-70, Pro-76 to Gly-85, Met-1 15 to Gly-120, 
Glu-162 to Lys-171, Pro-222 to Tvr-228, Glu-242 to Thr-248, Lvs-261 to Gly-269. 


830644 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO- 685 as 
residues: He- 1 to Ser-l 0. 


830707 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 686 as 
residues: Asn-34 to Leu-53, Gln-61 to Leu-67. 
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830709 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 687 as 
residues: Are- 13 to Gin- 1 8. Pro-22 to Ala-40. Ala-66 to Asp-84. Glu-94 to Are- 101. 


830733 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 688 as 
residues: Glu-i to Asp-8. 


830855 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 690 as 
residues: Ser-1 to His-6. 


830949 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 691 as 
residues: Arg-5 to Arg-12, Gly-25 to Trp-30, Thr-77 to Trp-96, Thr-iOl to Glu-106, 
Gly-109 to Arg-127. 


830965 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 692 as 
residues: Leu-24 to Are-56. Pro-83 to Are-90, lie- 1 10 to He- 1 15, Lys-123 to Vai-136. 


830973 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 693 as 
residues: Ser-1 lo Asn-7, Tvr-13 to Asp-23. 


830989 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 695 as 
residues: Cys-2 to Ser-1 6, GIu-55 to Lys-6i, Pro-83 to Leu-88, Ser-135 to Pro- 148, Val- 
152 to Arg-163, Pro-223 to Thr-230, Ala-242 to VaI-253, Arg-258 to GIu-274, Gly-290 
to Asp-300, Lys-337 to Asn-345, Asp-373 to Ala-398, GIy-401 to Lys-406, GIn-410 to 
Ala-430- Pro-433 to Gln-460. 


831134 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 696 as 
residues: Ala- 19 to His-24. 


831200 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 697 as 
residues: Trp-1 to Gly-6. 


831531 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 699 as 
residues: Ser-94 to Asn-1 16, Glu-139 to Asp- 1 55, Tyr-190 to Leu- 195, Ile-230 to lle- 
235.Ser-309 ioGlu-317. 


831665 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 700 as 
residues: Leu-4 to Trp-12. 


831724 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 701 as 
residues: Pro-26 to Lys-32. 


831884 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 702 as 
residues: Pro-46 to Ala-52, Thr-68 to Trp-86, Arft-91 to Ars-96, Lys-127 to Asp-141. 


831897 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 703 as 
residues: Pro-10 to Ser-20, Val-73 to Ser-78, Asp-123 to Glu-134, Leu-138 to Val-149, 
Ala- 1 8 1 to Ala- 1 87, Thr- 1 89 to Val- 1 96, Arg-2 1 3 to Gin-224. 


831922 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 704 as 
residues: Leu-32 to Asp-37. Ile-43 to Asn-49. 


832266 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 707 as 
residues: Ala-73 to Arg-79. 


832309 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 708 as 
residues: Val-10 to Gly-15, Ser-98 to Thr-105. 


832342 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 709 as 
residues: Pro-9 to Trp-1 6, Thr-66 to Ser-72. 


832351 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 710 as 
residues: Asp- 1 6 to Val-2 1 , Leu-54 to Asp-7 1 . 


832352 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 71 1 as 
residues: Asp- 16 to Val-2 1, Leu-33 to Asp-50. 


832434 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 712 as 
residues: Tyr-15 to Glu-23. Ser-46 to Arg-5 1 . GIn-56 to Trp-6I, Pro-79 to Lvs-86. 


832490 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 713 as 
residues: Arg-16 to Gly-23. Ala-37 to Asp-46, Asp-91 to Asp-97. 


832573 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 714 as 
■esidues: Ala-9 to Gin- 16, Glu-2 1 to Are-27, GIy-66 to Pro-72. 


833394 

I 
1 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 716 as 
-esidues: Glu-1 to GIy-6, Asp-12 to Gly-22, lle-28 to Gln-33, Cys-86 to Gly-92. Gly-96 
olle-105. 


835355 iPreferrcd epitopes include those comprising a sequence shown in SEQ ID NO. 717 as 
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residues: Glu-8 to Scr- 1 5, Gly-42 to Leu-49, Pro-73 to Gly-79, Tyr-82 to Arg-87, Ser- 
1 09 to Gly- 1 1 S. Glu- 1 22 to lie- 1 28, Asp- 1 32 to Gly- 1 37, Asp- 1 46 to Arg- 1 5 1 , Pro- 1 53 
to Lys- 1 58, Gly- 1 9 1 to His- 197. Tyr-2 1 0 to Ser-2 1 8, Lys-234 to Gly-239, Aia-246 to 
Ala-252, His-257 to Pro-268. Ser-274 to Gly-280, Pro-316 to Tyr-323, lle-358 to Leu- 
363, Gln-375 to Tyr-jS 1. Gln-390 to Tvr-397, Gln-418 to Cys-430. 


835497 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 718 as 
residues: Glu- 141 to Pro- 15 L Asp- 179 to Glu- 184. Gly-214 to Ser-2 19, Thr-226 to Tyr- 
231, Thr-239 to Glv-248. Pro-281 to GIv-297, Pro-326 to Arc-336. Gln-408 to Asp-416. 


835978 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 720 as 
residues: Trp-25 to Val-3 1. 


836274 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 722 as 
residues: Ser-1 toGlu-9. 


836731 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 723 as 
residues: Lys-15 to Glu-22, GIy-25 to Ala-^34, Glu-75 to Gly-81. GIn-91 to Val-100, Pro- 
146 to Glu- 155. Gln-161 to Phe-167, Asn-170 to Glv-178. 


838014 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 724 as 
residues: Arg-1 to Pro- 10, Asp- 170 to Pro- 176, Arg-203 to Tyr-2 12, Gly-228 to Lys- 
235. 


838874 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 725 as 
residues: Gln-30 to Gln-45. 


839120 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 726 as 
residues: Thr-22 to Ara-27, Ar2-69 to Gly-75, Leu-77 to Pro-85. 


839611 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 727 as 
residues: Asp- 1 2 to Thr- 1 7. 


840138 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 728 as 
residues: Ser-l to Thr- 10. 


840616 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 729 as 
residues: Lys-93 to Gly-99, Glu- 144 to Leu- 160, Ser-265 to Asp-270, Thr-382 to Gln- 
396, Vai-5 1 2 to Val-5 1 7. Glu-5 1 9 to Asp-535. 


840780 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 730 as 
residues: Leu-8 to Gly- 14, Pro-151 toGlu-157. 


840857 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 73 1 as 
residues: Gln-7 to Glu-22, Ala-27 to Arg-46, Ser-138 to Lys-147, Lys-158 to Pro-163, 
Asn-171 to Glu- 187. Glu-202 to Val-208, Glu-234 to GIy-240, Ser-253 to Lys-260, Gln- 
272 to Pro-279, Arg-292 to Glu-307, Arg-310 to Arg-317, Asp-342 to GIy-35 L Pro-367 
to Gly-375, Pro-378 to Arg-388, Leu-425 to Ala-447, Arg-536 to Asp-544, Lys-551 to 
Lys-561, Val-599 to Asp-604, Ser-622 to Ala-630, Pro-653 to Phe-659, Thr-666 to lle- 
673, Pro-699 to Phe-705. Asn-709 to Gly-719, Ala-725 to Phe-737. 


840862 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 732 as 
residues: Arg-2 to Pro- 12, Lys-32 to Asn-37, His-75 to Asn-82. 


840864 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 733 as 
residues: Pro-17 to Arg-30, Cys-34 to Gly^O, Met- 74 to GIu-8I, Pro-106 to Asp-1 11, 
Val-136 to Cys-147, Asn-192 to Asp-198. 


840938 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 735 as 
residues: Ser- 140 to Thr-148, Thr-194 to Lys-202. 


841884 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 736 as 
residues: Thr-34 to Glu-47. 


842241 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 737 as 
residues: Thr-92 to Lys-101, Glu-134 to Thr-142, Glu-149 to Lys-155, Trp-179 to Ser- 
187, Thr-205 to Arg-21 1, Ser-2 18 to Tyr-225, Asp-283 to Gln-290, Glu-292 to Ile-302, 
Asn-304 to Met-315. 


843712 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 738 as 
residues: Ar^-lO to Asn-16, Ala-59 to Pro-67. 


844040 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 739 as 
residues: Phc-59 to GIu-68, Lvs-105 to GIv-1 1 1. 


844617 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 742 as 
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residues: Artz-1 to Lvs-7. 


846187 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 745 as 
residues: Gly-8 to Gly-14. Gly-41 to Glu-48. Gluo4 to Lys-74, Glu-87 to Arg-98, Thr- 
158 to Asn-166, Gly-247 to Ser-254, Gly-257 to Arg-277, Ala-437 to Ser-444'i Lys-505 
to Arg-510, Phe-5i9 toTyr-525, Lys-531 to Pro-538, Gly-562 to Leu-571, Phe-606 to 
VaI-613, Val-692 to Ala-697, Ser-705 to Leu-715, Leu-742 to Cys-747. 


HANGA53R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 749 as 
residues: Arg-4 to Ser-9. 


HAHCP93R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 752 as 
residues: Ser-l to Ser-12. Thr-23 to Arg-28. 


HBGAA76R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 753 as 
residues: Ser-4 to Ser- 1 1 . Pro-27 to Asn-37. 


HTXP129R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 756 as 
residues: Thr-i7 to Leu-24. Thr-57 to Tyr-67, Leu-92 to Phe-102, Asn-128 to Gln-134. 


HBGAA54R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 760 as 
residues: Arg-62 to Lcu-70, lle-74 to Arg-79. 


HDPJR77R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 763 as 
residues: Glu-7 to Lys-22. Thr-33 to Glu-39, Lys-69 to Glu-76, Asp-84 to Tyr-90. 


HTT1041R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 764 as 
residues: Val-17 to Ser-22. Arg-4l to Glu-46, Lys-50 to Pro-75, Ser-92 to Pro-100. 


HDPUL86R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 767 as 
residues: Lys-7 to Gly-13. 


HTXNT16R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 768 as 
residues: Leu-67 to Asn-72, Thr-102 to Phe-1 11, Gly-127 to Gin- 1 35. 


HLXNA54R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 770 as 
residues: Gln-1 to Glu-6, Pro-23 to Trp-3L Are-46 to Trp-5I. 


H2LAX93R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 772 as 
residues: Glu-3 to Gin- 10. 


HWAFWIOR 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 773 as 
residues: GIu-13 to Asp-22, His-34 to Trp-40, Arg-69 to Lys-75. 


HBGDD17R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 775 as 
residues: ArR-23 to Thr-28, Pro-40 to Glu-5I, AIa-62 to His-68. 


H2CBB43R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 778 as 
residues: Asp-90 to Asp-95, Arg-106 to Thr-1 17. 


H2CBQ77R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 779 as 
residues: Asp-1 1 to Glv-16, Gin- 1 9 to Tyr-24. Pro-34 to Gly-46. 


HOEMK06R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 78 1 as 
residues: Pro-1 to Gin- 1 4. 


HCHAG30R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 783 as 
residues: Gly-1 to Trp-7. 


HAEA126R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 788 as 
residues: Lys-32 to Val-40, Arg-43 to Pro-51. 


H2CBN76R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 791 as 
residues: Ala- 1 7 to Leu-22, Thr-72 to Lys-77. 


HAGFX49R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 792 as 
residues : A la- 1 0 to Leu- 1 5 , His-64 to Cys-7 1 . 


HTXiCR32R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 794 as 
residues: Ser-2 to Gly-12, Glu-57 to Val-65. 


H6EAF46R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 796 as 
residues: Arg-ll to Ser-21. 


H2LAK40R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 798 as 
residues: Glu-1 1 to Lys-20, Pro-22 to Arg-28. 


H2LAY71R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 799 as 
residues: Arg-26 to Leu-36, Gln-82 to Asp-lOl, Arg-103 to Arg-108, Arg-1 13 to Arg- 
131. 


HASAW80R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 803 as 
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residues: Gly-1 to Are-6. Ala- 19 to Pro-27, Gly-34 to Phe-40. 


HCHAF25R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 804 as 
residues: Ser-30 to Thr-40. Leu-78 to Val-85, Asp-92 to Ala-97. 


HLTHH84R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 805 as 
residues: Glu-2 to Ala-8. 


HADDC09R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 808 as 
residues: Leu-3 to Gly-9, Thr-20 to Glv-29. 


HAQAIIOR 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 81 1 as 
residues: GIy-1 to Lys-21. 


HBGBT78R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 814 as 
residues: Asn-1 to Lvs-22. 


HBGCB06R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 815 as 
residues: Phe-1 to Phe-15. 


HCHMVV05R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 823 as 
residues: Pro-6 to Ser-1 1. 


HODFW25R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 829 as 
residues: Ser-I to Thr-8. Ghi-17 to Ala-32, Arti-39 to Trp-47. 


H0EMQ9IR 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 830 as 
residues: Aru-8 to Ser-H- 


HOGBG56R 


Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 831 as 
residues: Lvs-20 to Arii-25. 
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The present invention encompasses polypeptides comprising, or alternatively 
consisting of, an epitope of the polypeptide sequence shown in SEQ ID NO:Y, or an epitope 
of the polypeptide sequence encoded by the cDNA in the related cDNA clone contained in a 
deposited library or encoded by a polynucleotide that hybridizes lo the complement of an 
5 epitope encoding sequence of SEQ ID NO:X, or an epitope encoding sequence contained in 
the deposited cDNA clone under stringent hybridization conditions, or alternatively, under 
lower stringency hybridization conditions, as defined supra. The present invention further 
encompasses polynucleotide sequences encoding an epitope of a polypeptide sequence of the 
invention (such as, for example, the sequence disclosed in SEQ ID NO:X), polynucleotide 

10 sequences of the complementary strand of a polynucleotide sequence encoding an epitope of 
the invention, and polynucleotide sequences which hybridize to this complementary strand 
under stringent hybridization conditions or alternatively, under lower stringency 
hybridization conditions, as defined supra. 

The term "epitopes," as used herein, refers to portions of a polypeptide having 

15 antigenic or immunogenic activity in an animal, preferably a mammal, and most preferably 
in a human. In a preferred embodiment, the present invention encompasses a polypeptide 
comprising an epitope, as well as the polynucleotide encoding this polypeptide.. An 
"immunogenic epitope," as used herein, is defined as a portion of a protein that elicits an 
antibody response in an animal, as determined by any method known in the art, for example, 

20 by the methods for generating antibodies described infra. (See, for example, Geysen et al., 
Proc. Natl. Acad. Sci. USA 81:3998- 4002 (1983)). The term "antigenic epitope,'' as used 
herein, is defined ais a portion of a protein to which an antibody can immunospecifically bind 
its antigen as determined by any method well known in the art, for example, by the 
immunoassays described herein. Immunospecific binding excludes non-specific binding but 

25 does not necessarily exclude cross- reactivity with other antigens. Antigenic epitopes need 
not necessarily be immunogenic. 

Fragments which function as epitopes may be produced by any conventional means. 
(See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) further 
described in U.S. Patent No. 4,631,211.) 

30 In the present invention, antigenic epitopes preferably contain a sequence of at least 4, 

at least 5, at least 6, at least 7, more preferably at least 8, at least 9, at least 10, at least II, at 
least 12, at least 13, at least 14, at least 15, at least 20, at least 25, at least 30, at least 40, at 
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least 50, and, most preferably, between about 15 to about 30 amino acids. Preferred 
polypeptides comprising immunogenic or antigenic epitopes are at least 10, 15, 20, 25, 30, 
35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 amino acid residues in length. 
Additional non-exclusive preferred antigenic epitopes include the antigenic epitopes 
5 disclosed herein, as well as portions thereof. Antigenic epitopes are useful, for example, to 
raise antibodies, including monoclonal antibodies, that specifically bind the epitope. 
Preferred antigenic epitopes include the antigenic epitopes disclosed herein, as well as any 
combination of two, three, four, five or more of these antigenic epitopes. Antigenic epitopes 
can be used as the target molecules in immunoassays. (See, for instance, Wilson et al., Cell 

10 37:767-778 (1984); Sutcliffe et al.. Science 219:660-666 (1983)). 

Similarly, immunogenic epitopes can be used, for example, to induce antibodies 
according to methods well known in the art. (See, for instance, Sutcliffe et al., supra; Wilson 
et al., supra; Chow et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle et al., J. Gen. 
Virol. 66:2347-2354 (1985). Preferred immunogenic epitopes include the immunogenic 

15 epitopes disclosed herein, as well as any combination of two, three, four, five or more of 
these immunogenic epitopes. The polypeptides comprising one or more immunogenic 
epitopes may be presented for eliciting an antibody response together with a carrier protein, 
such as an albumin, to an animal system (such as rabbit or mouse), or, if the polypeptide is of 
sufficient length (at least about 25 amino acids), the polypeptide may be presented without a 

20 carrier. However, immunogenic epitopes comprising as few as 8 to 10 amino acids have 
been shown to be sufficient to raise antibodies capable of binding to, at the very least, linear 
epitopes in a denatured polypeptide (e.g., in Western blotting). 

Epitope-bearing polypeptides of the present invention may be used to induce 
antibodies according to methods well known in the art including, but not limited to, in vivo 

25 immunization, in vitro immunization, and phage display methods. See, e.g., Sutcliffe et al., 
supra; Wilson et al, supra, and Bittle et al., J. Gen. Virol., 66:2347-2354 (1985). If in vivo 
immunization is used, animals may be immunized with free peptide; however, anti-peptide 
antibody titer may be boosted by coupling the peptide to a macromolecular carrier, such as 
keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides containing 

30 cysteine residues may be coupled to a carrier using a linker such as maleimidobenzoyl- N- 
hydroxysuccinimide ester (MBS), while other peptides may be coupled to carriers using a 
more general linking agent such as glutaraldehyde. Animals such as rabbits, rats and mice 
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are immunized with either free or carrier- coupled peptides, for instance, by intraperitoneal 
and/or intradermal injection of emulsions containing about 100 fig of peptide or carrier 
protein and Freund's adjuvant or any other adjuvant known for stimulating an immune 
response. Several booster injections may be needed, for instance, at intervals of about two 
5 weeks, to provide a useful titer of anti-peptide antibody which can be detected, for example, 
by ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide 
antibodies in serum from an immunized animal may be increased by selection of anti-peptide 
antibodies, for instance, by adsorption to the peptide on a solid support and elution of the 
selected antibodies according to methods well known in the art. 

10 As one of skill in the art will appreciate, and as discussed above, the polypeptides of 

the present invention , and immunogenic and/or antigenic epitope fragments thereof can be 
fused to other polypeptide sequences. For example, the polypeptides of the present 
invention may be fused with the constant domain of immunoglobulins (IgA, IgE, IgG, IgM), 
or portions thereof (CHI, CH2, CH3, or any combination thereof and portions thereof) 

15 resulting in chimeric polypeptides. Such fusion proteins may facilitate purification and may 
increase half-life in vivo. This has been shown for chimeric proteins consisting of the first 
two domains of the human CD4-polypeptide and various domains of the constant regions of 
the heavy or light chains of mammalian immunoglobulins. See, e.g., EP 394,827; 
Traunecker et al., Nature, 331:84-86 (1988). Enhanced delivery of an antigen across the 

20 epithelial barrier to the immune system has been demonstrated for antigens (e.g., insulin) 
conjugated to an FcRn binding partner such as IgG or Fc fragments (see, e.g., PCT 
Publications WO 96/22024 and WO 99/04813). IgG Fusion proteins that have a disulfide- 
linked dimeric structure due to the IgG portion desulfide bonds have also been found to be 
more efficient in binding and neutralizing other molecules than monomeric polypeptides or 

25 fragments thereof alone. See, e.g., Fountoulakis et al., J. Biochem., 270:3958-3964 (1995). 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together with 
another human protein or part thereof In many cases, the Fc part in a fusion protein is 
beneficial in therapy and diagnosis, and thus can result in, for example, improved 

30 pharmacokinetic properties. (EP-A 0232 262.) Alternatively, deleting the Fc part after the 
fijsion protein has been expressed, detected, and purified, may be desired. For example, the 
Fc portion may hinder therapy and diagnosis if the fusion protein is used as an antigen for 
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immunizations. In drug discovery, for example, human proteins, such as hlL-5, have been 
fused with Fc portions for the purpose of high-throughput screening assays to identify 
antagonists of hlL-5. (See, D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. 
Johanson et al., J. Biol. Chem. 270:9459-9471 (1995).) 
5 Moreover, the polypeptides of the present invention can be fused to marker 

sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, such as 
the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311), among others, many of which are commercially available. As described in Gentz et 

10 al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides for 
convenient purification of the fusion protein. Another peptide tag useful for purification, the 
"HA" tag, corresponds to an epitope derived from the influenza hemagglutinin protein. 
(Wilson etal., Cell 37:767 (1984).) 

Thus, any of these above fiisions can be engineered using the polynucleotides or the 

15 polypeptides of the present invention. 

Nucleic acids encoding the above epitopes can also be recombined with a gene of 
interest as an epitope tag (e.g., the hemagglutinin ("HA") tag or flag tag) to aid in detection 
and purification of the expressed polypeptide. For example, a system described by 
Janknecht et al. allows for the ready purification of non-denatured fusion proteins expressed 

20 in human cell lines (Janknecht et al., Proc. Natl. Acad. Sci. USA 88:8972- 897 (1991)). In 
this system, the gene of interest is subcloned into a vaccinia recombination plasmid such that 
the open reading frame of the gene is translationally fused to an amino-terminal tag 
consisting of six histidine residues. The tag serves as a matrix binding domain for the fusion 
protein. Extracts from cells infected with the recombinant vaccinia virus are loaded onto 

25 Ni2+ nitriloacetic acid-agarose column and histidine-tagged proteins can be selectively 
eluted with imidazole-containing buffers. 

Additional fusion proteins of the invention may be generated through the techniques 
of gene-shuffling, motif-shuffling, exon-shuffling, and/or codon-shuffling (collectively 
referred to as "DNA shuffling"). DNA shuffling may be employed to modulate the activities 

30 of polypeptides of the invention, such methods can be used to generate polypeptides with 
altered activity, as well as agonists and antagonists of the polypeptides. See, generally, U.S. 
Patent Nos. 5,605,793; 5,81 1,238; 5,830,721; 5,834,252; and 5,837,458, and Patten et al., 
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Curr. Opinion Biotechnol. 8:724-33 (1997); Harayama, Trends Biotechnol. 16(2):76-82 
(1998); Hansson, et al., J. Mol. Biol. 287:265-76 (1999); and Lorenzo and Blasco, 
Biotechniques 24(2):308- 13 (1998) (each of these patents and publications are hereby 
incorporated by reference in its entirety). In one embodiment, alteration of polynucleotides 
5 corresponding to SEQ ID NO:X and the polypeptides encoded by these polynucleotides may 
be achieved by DNA shuffling. DNA shuffling involves the assembly of two or more DNA 
segments by homologous or site-specific recombination to generate variation in the 
polynucleotide sequence. In another embodiment, polynucleotides of the invention, or the 
encoded polypeptides, may be altered by being subjected to random mutagenesis by error- 
10 prone PGR, random nucleotide insertion or other methods prior to recombination. In another 
embodiment, one or more components, motifs, sections, parts, domains, fragments, etc., of a 
polynucleotide encoding a polypeptide of the invention may be recombined y/ith one or more 
components, motifs, sections, parts, domains, fragments, etc. of one or more heterologous 
molecules. 

15 As discussed herein, any polypeptide of the present invention can be used to generate 

fusion proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the polypeptide of 
the present invention can be used to indirectly detect the second protein by binding to the 
polypeptide. Moreover, because secreted proteins target cellular locations based on 

20 trafficking signals, polypeptides of the present invention which are shown to be secreted can 
be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present invention 
include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through linker 

25 sequences. 

In certain preferred embodiments, proteins of the invention comprise fusion proteins 
wherein the polypeptides are N and/or C- terminal deletion mutants. In preferred 
embodiments, the application is directed to nucleic acid molecules at least 80%, 85%, 90%, 
95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequences encoding polypeptides 
30 having the amino acid sequence of the specific N- and C-terminal deletions mutants. 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
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Moreover, fusion proteins may also be engineered to improve characteristics of the 
polypeptide of the present invention. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the polypeptide to 
improve stability and persistence during purification from the host cell or subsequent 
5 handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final preparation of the polypeptide. The 
addition of peptide moieties to facilitate handling of polypeptides are familiar and routine 
techniques in the art. 

10 Vectors. Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant techniques. 
The vector may be, for example, a phage, plasmid, viral, or retroviral vector. Retroviral 
vectors may be replication competent or replication defective. In the latter case, viral 

15 propagation generally will occur only in complementing host cells. 

The polynucleotides of the invention may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
a virus, it may be packaged in vitro using an appropriate packaging cell line and then 

20 transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac promoters, the SV40 
early and late promoters and promoters of retroviral LTRs, to name a few. Other suitable 
promoters will be known to the skilled artisan. The expression constructs will further contain 

25 sites for transcription initiation, termination, and, in the transcribed region, a ribosome 
binding site for translation. The coding portion of the transcripts expressed by the constructs 
will preferably include a translation initiating codon at the beginning and a termination codon 
(UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to be translated. 
As indicated, the expression vectors will preferably include at least one selectable 

30 marker. Such markers include dihydrofolate reductase, 0418 or neomycin resistance for 
eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance genes for 
culturing in E. coli and other bacteria. Representative examples of appropriate hosts include, 
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but are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella 
typhimurium cells; fungal cells, such as yeast cells (e.g., Saccharomyces cerevisiae or Pichia 
pastoris (ATCC Accession No. 201 178)); insect cells such as Drosophila S2 and Spodoptera 
Sf9 cells; animal cells such as CHO, COS, 293, and Bowes melanoma cells; and plant cells. 
5 Appropriate culture mediums and conditions for the above-described host cells are known in 
the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, pNH16a, 
pNHISA, pNH46A, available from Stratagene Cloning Systems, Inc.; and ptrc99a, pKK223- 

10 3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, Inc. Among preferred 
eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from 
Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Preferred 
expression vectors for use in yeast systems include, but are not limited to pYES2, pYDl, 
pTEFl/Zeo, pYES2/GS, pPICZ, pGAPZ, pGAPZalph, pPIC9, pPiC3.5, pHlL-D2, pHIL-Sl, 

15 pPIC3.5K, pPIC9K, and PA0815 (all available from Invitrogen, Carlbad, CA). Other 
suitable vectors will be readily apparent to the skilled artisan. 

Introduction of the construct into the host cell can be effected by calcium phosphate 
transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, 
electroporation, transduction, infection, or other methods. Such methods are described in 

20 many standard laboratory manuals, such as Davis et al., Basic Methods In Molecular Biology 
(1986). It is specifically contemplated that the polypeptides of the present invention may in 
fact be expressed by a host cell lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 

25 extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. 

Polypeptides of the present invention can also be recovered from: products purified 

30 from natural sources, including bodily fluids, tissues and cells, whether directly isolated or 
cultured; products of chemical synthetic procedures; and products produced by recombinant 
techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, 
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higher plant, insect, and inammalian cells. Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention may be 
glycosylated or may be non-glycosylated. In addition, polypeptides of the invention may 
also include an initial modified methionine residue, in some cases as a result of host- 
5 mediated processes. Thus, it is well known in the art that the N-terminal methionine encoded 
by the translation initiation codon generally is removed with high efficiency from any protein 
after translation in all eukaryotic cells. While the N-terminal methionine on most proteins 
also is efficiently removed in most prokaryotes, for some proteins, this prokaryotic removal 
process is inefficient, depending on the nature of the amino acid to which the N-terminal 

10 methionine is covalently linked. 

In one embodiment, the yeast Pichia pastoris is used to express polypeptides of the 
invention in a eukaryotic system. Pichia pastoris is a methylotrophic yeast which can 
metabolize methanol as its sole carbon source. A main step in the methanol metabolization 
pathway is the oxidation of methanol to formaldehyde using O?. This reaction is catalyzed 

15 by the enzyme alcohol oxidase. In order to metabolize methanol as its sole carbon source, 
Pichia pastoris must generate high levels of alcohol oxidase due, in part, to the relatively low 
affinity of alcohol oxidase for O?. Consequently, in a growth medium depending on 
methanol as a main carbon source, the promoter region of one of the two alcohol oxidase 
genes (AOXl) is highly active. In the presence of methanol, alcohol oxidase produced from 

2Q the AOXl gene comprises up to approximately 30% of the total soluble protein in Pichia 
pastoris. See, Ellis, S.B., et ai, MoL Cell BioL 5:111 1-21 (1985); Koutz, P.J, et aL Yeast 
5:167-77 (1989); Tschopp, J.F., et aL. Nuci Acids Res. 15:3859-76 (1987). Thus, a 
heterologous coding sequence, such as, for example, a polynucleotide of the present 
invention, under the transcriptional regulation of all or part of the AOXl regulatory sequence 

25 is expressed at exceptionally high levels in Pichia yeast grown in the presence of methanol. 

In one example, the plasmid vector pPlC9K is used to express DNA encoding a 
polypeptide of the invention, as set forth herein, in a Pichea yeast system essentially as 
described in ''Pichia Protocols: Methods in Molecular Biology," D.R. Higgins and J. Cregg, 
eds. The Humana Press, Totowa, NJ, 1998. This expression vector allows expression and 

3Q secretion of a polypeptide of the invention by virtue of the strong AOXl promoter linked to 
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the Pichia pcistoris alkaline phosphatase (PHO) secretory signal peptide (i.e,, leader) located 
upstream of a multiple cloning site. 

Many other yeast vectors could be used in place of pPIC9K, such as, pYES2, pYDl, 
pTEFl/Zeo, pYES2/GS, pPlCZ, pGAPZ, pGAPZalpha, pPlC9, pPIC3.5, pHIL-D2, pHIL-Sl, 
5 pPIC3.5K, and PA0815, as one skilled in the art would readily appreciate, as long as the 
proposed expression construct provides appropriately located signals for transcription, 
translation, secretion (if desired), and the like, including an in-frame AUG as required. 

In another embodiment, high-level expression of a heterologous coding sequence, 
such as, for example, a polynucleotide of the present invention, may be achieved by cloning 

10 the heterologous polynucleotide of the invention into an expression vector such as, for 
example, pGAPZ or pGAPZalpha, and growing the yeast culture in the absence of methanol. 

In addition to encompassing host cells containing the vector constructs discussed 
herein, the invention also encompasses primary, secondary, and immortalized host cells of 
vertebrate origin, particularly mammalian origin, that have been engineered to delete or 

15 replace endogenous genetic material (e.g., coding sequence), and/or to include genetic 
material (e.g., heterologous polynucleotide sequences) that is operably associated with 
polynucleotides of the invention, and which activates, alters, and/or amplifies endogenous 
polynucleotides. For example, techniques known in the art may be used to operably associate 
heterologous control regions (e.g., promoter and/or enhancer) and endogenous 

20 polynucleotide sequences via homologous recombination (see, e.g., U.S. Patent No. 
5,641,670, issued June 24, 1997; International Publication No. WO 96/29411, published 
September 26, 1996; International Publication No. WO 94/12650, published August 4, 1994; 
Roller et al., Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al.. Nature 
342:435-438 (1989), the disclosures of each of which are incorporated by reference in their 

25 entireties). 

In addition, polypeptides of the invention can be chemically synthesized using 
techniques known in the art (e.g., see Creighton, 1983, Proteins: Structures and Molecular 
Principles, W.H. Freeman & Co., N.Y., and Hunkapiller et ah, Nature, 310:105-1 1 1 (1984)). 
For example, a polypeptide corresponding to a fragment of a polypeptide can be synthesized 
30 by use of a peptide synthesizer. Furthermore, if desired, nonclassical amino acids or 
chemical amino acid analogs can be introduced as a substitution or addition into the 
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polypeptide sequence. Non-classical amino acids include, but are not limited to, to the D- 
isomers of the common amino acids, 2,4-diaminobutyric acid, a-amino isobutyric acid, 4- 
aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 
2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, 
5 hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoro-amino acids, designer 
amino acids such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, 
and amino acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L 
(levorotary). 

10 Non-naturally occurring variants may be produced using art-known mutagenesis 

techniques, which include, but are not limited to oligonucleotide mediated mutagenesis, 
alanine scanning, PGR mutagenesis, site directed mutagenesis (see, e.g., Carter et ai, Nucl 
Acids Res. 75:4331 (1986); and Zoller et ai, Nucl Acids Res. 70:6487 (1982)), cassette 
mutagenesis (see, e.g., Wells et ai, Gene i^:315 (1985)), restriction selection mutagenesis 

15 (see, eg, Wells et ai, Philos. Trans. R. Soc. London SerA 317:415 (1986)). 

The invention additionally, encompasses polypeptides of the present invention which 
are differentially modified during or after translation, e.g., by glycosylation, acetylation, 
phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic 
cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of numerous 

20 chemical modifications may be carried out by known techniques, including but not limited, to 
specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 
protease, NaBHa; acetylation, formylation, oxidation, reduction; metabolic synthesis in the 
presence of tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, for 

25 example, e.g., N-linked or 0-linked carbohydrate chains, processing of N-terminal or 
C-terminal ends), attachment of chemical moieties to the amino acid backbone, chemical 
modifications of N-linked or 0-linked carbohydrate chains, and addition or deletion of an 
N-terminal methionine residue as a result of procaryotic host cell expression. The 
polypeptides may also be modified with a detectable label, such as an enzymatic, fluorescent, 

30 isotopic or affinity label to allow for detection and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the 
polypeptides of the invention which may provide additional advantages such as increased 
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solubility, stability and circulating time of the polypeptide, or decreased immunogenicity (see 
U.S. Patent No. 4,179,337). The chemical moieties for derivitization may be selected from 
water soluble polymers such as polyethylene glycol, ethylene glycol/propylene glycol 
copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The 
5 polypeptides may be modified at random positions within the molecule, or at predetermined 
positions within the molecule and may include one, two, three or more attached chemical 
moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. 

For polyethylene glycol, the preferred molecular weight is between about I kDa and about 
10 100 kDa (the term "about" indicating that in preparations of polyethylene glycol, some 

molecules will weigh more, some less, than the stated molecular weight) for ease in handling 

and manufacturing. Other sizes may be used, depending on the desired therapeutic profile 

(e.g., the duration of sustained release desired, the effects, if any on biological activity, the 

ease in handling, the degree or . lack of antigenicity and other known effects of the 
15 polyethylene glycol to a therapeutic protein or analog). For example, the polyethylene glycol 

may have an average molecular weight of about 200; 500; 1000; 1500; 2000; 2500; 3000; 

3500; 4000; 4500; 5000; 5500; 6000; 6500; 7000; 7500; 8000; 8500; 9000; 9500; 10,000; 

10,500; 11,000; 11,500; 12,000; 12,500; 13,000; 13,500; 14,000; 14,500; 15,000; 15,500; 

16,000; 16,500; 17,000; 17,500; 18,000; 18,500; 19,000; 19,500; 20,000; 25,000; 30,000; 
20 35,000; 40,000; 50,000; 55,000; 60,000; 65,000; 70,000; 75,000; 80,000; 85,000; 90,000; 

95,000; or 100,000 kDa. 

As noted above, the polyethylene glycol may have a branched structure. Branched 

polyethylene glycols are described, for example, in U.S. Patent No. 5,643,575; Morpurgo et 

aL, AppL Biochem. BiotechnoL 56:59-72 (1996); Vorobjev et al. Nucleosides Nucleotides 
25 75:2745-2750 (1999); and Caliceti et al, Bioconjug, Chem. 70:638-646 (1999), the 

disclosures of each of which are incorporated herein by reference. 

The polyethylene glycol molecules (or other chemical moieties) should be attached to 

the protein with consideration of effects on functional or antigenic domains of the protein. 

There are a number of attachment methods available to those skilled in the art, e.g., EP 0 401 
30 384, herein incorporated by reference (coupling PEG to G-CSF), see also Malik et al., Exp. 

Hematol. 20:1028-1035 (1992) (reporting pegylation of GM-CSF using tresyl chloride). For 

example, polyethylene glycol may be covalently bound through amino acid residues via a 
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reactive group, such as, a free amino or carboxyl group. Reactive groups are those to which 
an activated polyethylene glycol molecule may be bound. The amino acid residues having a 
free amino group may include lysine residues and the N-terminal amino acid residues; those 
having a free carboxyl group may include aspartic acid residues glutamic acid residues and 
5 the C-terminal amino acid residue. Sulfliydryl groups may also be used as a reactive group 
for attaching the polyethylene glycol molecules. Preferred for therapeutic purposes is 
attachment at an amino group, such as attachment at the N-terminus or lysine group. 

As suggested above, polyethylene glycol may be attached to proteins via linkage to 
any of a number of amino acid residues. For example, polyethylene glycol can be linked to a 

10 proteins via covalent bonds to lysine, histidine, aspartic acid, glutamic acid, or cysteine 
residues. One or more reaction chemistries may be employed to attach polyethylene glycol 
to specific amino acid residues (e.g., lysine, histidine, aspartic acid, glutamic acid, or 
cysteine) of the protein or to more than one type of amino acid residue (e.g., lysine, histidine, 
aspartic acid, glutamic acid, cysteine and combinations thereof) of the protein. 

15 One may specifically desire proteins chemically modified at the N-terminus. Using 

polyethylene glycol as an illustration of the present composition, one may select from a 
variety of polyethylene glycol molecules (by molecular v^eight, branching, etc.), the 
proportion of polyethylene glycol molecules to protein (polypeptide) molecules in the 
reaction mix, the type of pegylation reaction to be performed, and the method of obtaining 

20 the selected N-terminally pegylated protein. The method of obtaining the N-terminally 
pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminally pegylated material from a population 
of pegylated protein molecules. Selective proteins chemically modified at the N-terminus 
modification may be accomplished by reductive alkylation v^^hich exploits differential 

25 reactivity of different types of primary amino groups (lysine versus the N-terminal) available 
for derivatization in a particular protein. Under the appropriate reaction conditions, 
substantially selective derivatization of the protein at the N-terminus with a carbonyl group 
containing polymer is achieved. 

As indicated above, pegylation of the proteins of the invention may be accomplished 

30 by any number of means. For example, polyethylene glycol may be attached to the protein 
either directly or by an intervening linker. Linkerless systems for attaching polyethylene 
glycol to proteins are described in Delgado et al, Crit. Rev. Thera. Drag Carrier Sys. 9:249- 
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304 (1992); Francis etai. Intern. J, ofHematoL 55:1-18 (1998); U.S. Patent No. 4,002,531; 
U.S. Patent No. 5,349,052; WO 95/06058; and WO 98/32466, the disclosures of each of 
which are incorporated herein by reference. 

One system for attaching polyethylene glycol directly to amino acid residues of 
5 proteins without an intervening linker employs tresylated MPEG, which is produced by the 
modification of monmethoxy polyethylene glycol (MPEG) using tresylchloride 
(CISO2CH2CF3). Upon reaction of protein with tresylated MPEG, polyethylene glycol is 
directly attached to amine groups of the protein. Thus, the invention includes protein- 
polyethylene glycol conjugates produced by reacting proteins of the invention with a 
10 polyethylene glycol molecule having a 2,2,2-trifluoreothane sulphonyl group. 

Polyethylene glycol can also be attached to proteins using a number of different 
intervening linkers. For example, U.S. Patent No. 5,612,460, the entire disclosure of which is 
incorporated herein by reference, discloses urethane linkers for connecting polyethylene 
glycol to proteins. Protein-polyethylene glycol conjugates wherein the polyethylene glycol is 

15 attached to the protein by a linker can also be produced by reaction of proteins with 
compounds such as MPEG-succinimidylsuccinate, MPEG activated with 
1, T- carbon y Id i imidazole, MPEG-2,4,5-trichloropenylcarbonate, MPEG-p- 
nitrophenolcarbonate, and various MPEG-succinate derivatives. A number additional 
polyethylene glycol derivatives and reaction chemistries for attaching polyethylene glycol to 

20 proteins are described in WO 98/32466, the entire disclosure of which is incorporated herein 
by reference. Pegylated protein products produced using the reaction chemistries set out 
herein arc included within the scope of the invention. 

The number of polyethylene glycol moieties attached to each protein of the invention 
(i.e., the degree of substitution) may also vary. For example, the pegylated proteins of the 

25 invention may be linked, on average, to 1,2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 17, 20, or more 
polyethylene glycol molecules. Similarly, the average degree of substitution within ranges 
such as 1-3, 2-4, 3-5, 4-6, 5-7, 6-8, 7-9, 8-10, 9-1 1, 10-12, 1 1-13, 12-14, 13-15, 14-16, 15-17, 
16-18, 17-19, or 18-20 polyethylene glycol moieties per protein molecule. Methods for 
determining the degree of substitution are discussed, for example, in Delgado et al, Crit, 

30 Rev. Thera. Drug Carrier Sys. 9:249-304 (1992). 

The breast/ovarian cancer antigen polypeptides of the invention may be in monomers 
or multimers (i.e., dimers, trimers, tetramers and higher multimers). Accordingly, the present 
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invention relates to monomers and multimers of the polypeptides of the invention, their 
preparation, and compositions (preferably, Therapeutics) containing them. In specific 
embodiments, the polypeptides of the invention are monomers, dimers, trimers or tetramers. 
In additional embodiments, the multimers of the invention are at least dimers, at least trimers, 
5 or at least tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As used 
herein, the term homomer, refers to a multimer containing only polypeptides corresponding 
to the amino acid sequence of SEQ ID NO:Y or an amino acid sequence encoded by SEQ ID 
NO:X, and/or an amino acid sequence encoded by the cDNA in a related cDNA clone 

10 contained in a deposited library (including fragments, variants, splice variants, and fusion 
proteins, corresponding to any one of these as described herein). These homomers may 
contain polypeptides having identical or different amino acid sequences. In a specific 
embodiment, i homomer of the invention is a multimer containing only polypeptides having 
an identical amino acid sequence. In another specific embodiment, a homomer of the 

15 invention is a multimer containing polypeptides having different amino acid sequences. In 
specific embodiments, the multimer of the invention is a homodimer (e.g., containing 
polypeptides having identical or different amino acid sequences) or a homotrimer (e.g., 
containing polypeptides having identical and/or different amino acid sequences). In 
additional embodiments, the homomeric multimer of the invention is at least a homodimer, at 

20 least a homotrimer, or at least a homotetramer. 

As used herein, the term heteromer refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the 
polypeptides of the invention. In a specific embodiment, the multimer of the invention is a 
heterodimer, a heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric 

25 multimer of the invention is at least a heterodimer, at least a heterotrimer, or at least a 
heterotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic 
and/or covalent associations and/or may be indirectly linked, by for example, liposome 
formation. Thus, in one embodiment, multimers of the invention, such as, for example, 
30 homodimers or homotrimers, are formed when polypeptides of the invention contact one 
another in solution. In another embodiment, heteromultimers of the invention, such as, for 
example, heterotrimers or heterotetramers, are formed when polypeptides of the invention 
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contact antibodies to the polypeptides of the invention (including antibodies to the 
heterologous polypeptide sequence in a fusion protein of the invention) in solution. In other 
embodiments, multimers of the invention are formed by covalent associations with and/or 
between the polypeptides of the invention. Such covalent associations may involve one or 
5 more amino acid residues contained in the polypeptide sequence (e.g., that recited in SEQ ID 
NO:Y, or contained in a polypeptide encoded by SEQ ID NO:X, and/or by the cDNA in the 
related cDNA clone contained in a deposited library). In one instance, the covalent 
associations are cross-linking between cysteine residues^ located within the polypeptide 
sequences which interact in the native (i.e., naturally occurring) polypeptide. In another 

10 instance, the covalent associations are the consequence of chemical or recombinant 
manipulation. Alternatively, such covalent associations may involve one or more amino acid 
residues contained in the heterologous polypeptide sequence in a fusion protein. In one 
example, covalent associations are between the heterologous sequence contained in a fusion 
protein of the invention (see, e.g., US Patent Number 5,478,925). in a specific example, the 

15 covalent associations are between the heterologous sequence contained in a Fc fusion protein 
of the invention (as described herein). In another specific example, covalent associations of 
fusion proteins of the invention are between heterologous polypeptide sequence from another 
protein that is capable of forming covalently associated multimers, such as for example, 
oseteoprotegerin (see, e.g., International Publication NO: WO 98/49305, the contents of 

20 which are herein incorporated by reference in its entirety). In another embodiment, two or 
more polypeptides of the invention are joined through peptide linkers. Examples include 
those peptide linkers described in U.S. Pat. No. 5,073,627 (hereby incorporated by 
reference). Proteins comprising multiple polypeptides of the invention separated by peptide 
linkers may be produced using conventional recombinant DNA technology. 

25 Another method for preparing multimer polypeptides of the invention involves use of 

polypeptides of the invention ftised to a leucine zipper or isoleucine zipper polypeptide 
sequence. Leucine zipper and isoleucine zipper domains are polypeptides that promote 
multimerization of the proteins in which they are found. Leucine zippers were originally 
identified in several DNA-binding proteins (Landschulz et al., Science 240:1759, (1988)), 

30 and have since been found in a variety of different proteins. Among the known leucine 
zippers are naturally occurring peptides and derivatives thereof that dimerize or trimerize. 
Examples of leucine zipper domains suitable for producing soluble multimeric proteins of the 
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invention are those described in PCT application WO 94/10308, hereby incorporated by 
reference. Recombinant fusion proteins comprising a polypeptide of the invention fused to a 
polypeptide sequence that dimerizes or trimerizes in solution are expressed in suitable host 
cells, and the resulting soluble multimeric fusion protein is recovered from the culture 
5 supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced 
biological activity. Preferred leucine zipper moieties and isoleucine moieties are those that 
preferentially form trimers. One example is a leucine zipper derived from lung surfactant 
protein D (SPD), as described in Hoppe et al. (FEBS Letters 344:191, (1994)) and in U.S. 

10 patent application Ser. No. 08/446,922, hereby incorporated by reference. Other peptides 
derived from naturally occurring trimeric proteins may be employed in preparing trimeric 
polypeptides of the invention. 

In another example, proteins of the invention are associated by interactions between 
Flag® polypeptide sequence contained in fusion proteins of the invention containing Flag® 

15 polypeptide seuqence. In a further embodiment, associations proteins of the invention are 
associated by interactions between heterologous polypeptide sequence contained in Flag® 
fusion proteins of the invention and anti-Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known in 
the art. For example, polypeptides desired to be contained in the multimers of the invention 

20 may be chemically cross-linked using linker molecules and linker molecule length 
optimization techniques known in the art (see, e.g., US Patent Number 5,478,925, which is 
herein incorporated by reference in its entirety). Additionally, multimers of the invention 
may be generated using techniques known in the art to form one or more inter-molecule 
cross-links between the cysteine residues located within the sequence of the polypeptides 

25 desired to be contained in the multimer (see, e.g., US Patent Number 5,478,925, which is 
herein incorporated by reference in its entirety). Further, polypeptides of the invention may 
be routinely modified by the addition of cysteine or biotin to the C-terminus or N-terminus of 
the polypeptide and techniques known in the art may be applied to generate multimers 
containing one or more of these modified polypeptides (see, e.g., US Patent Number 

30 5,478,925, which is herein incorporated by reference in its entirety). Additionally, 
techniques known in the art may be applied to generate liposomes containing the polypeptide 
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components desired to be contained in the multimer of the invention (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering 
techniques known in the art. In one embodiment, polypeptides contained in multimers of the 
5 invention are produced recombinantly using fusion protein technology described herein or 
otherwise known in the art (see, e.g., US Patent Number 5,478,925, which is herein 
incorporated by reference in its entirety). In a specific embodiment, polynucleotides coding 
for a homodimer of the invention are generated by ligating a polynucleotide sequence 
encoding a polypeptide of the invention to a sequence encoding a linker polypeptide and then 

10 further to a synthetic polynucleotide encoding the translated product of the polypeptide in the 
reverse orientation from the original C-terminus to the N-terminus (lacking the leader 
sequence) (see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference 
in its entirety). In another embodiment, recombinant techniques described herein or 
otherwise known in the art are applied to generate recombinant polypeptides of the invention 

15 which contain a transmembrane domain (or hyrophobic or signal peptide) and which can be 
incorporated by membrane reconstitution techniques into liposomes (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). 



Antibodies 

20 Further polypeptides of the invention relate to antibodies and T-cell antigen receptors 

(TCR) which immunospecifically bind a polypeptide, polypeptide fragment, or variant of 
SEQ ID NO:Y, and/or an epitope, of the present invention (as determined by immunoassays 
well known in the art for assaying specific antibody-antigen binding). Antibodies of the 
invention include, but are not limited to, polyclonal, monoclonal, multispecific, human, 

25 humanized or chimeric antibodies, single chain antibodies, Fab fragments, F(ab') fragments, 
fragments produced by a Fab expression library, anti-idiotypic (anti-Id) antibodies 
(including, e.g., anti-Id antibodies to antibodies of the invention), and epitope-binding 
fragments of any of the above. The term "antibody," as used herein, refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 

30 molecules, i.e., molecules that contain an antigen binding site that immunospecifically binds 
an antigen. The immunoglobulin molecules of the invention can be of any type (e.g., IgG, 
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IgE, IgM, IgD, IgA and IgY), class (e.g., IgGl, IgG2, lgG3, IgG4, IgAl and IgA2) or 
subclass of immunoglobulin molecule. 

Most preferably the antibodies are human antigen-binding antibody fragments of the 
present invention and include, but are not limited to, Fab, Fab' and F(ab')2, Fd, single-chain 
5 Fvs (scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising 
either a VL or VH domain. Antigen-binding antibody fragments, including single-chain 
antibodies, may comprise the variable region(s) alone or in combination with the entirety or a 
portion of the following: hinge region, CHI, CH2, and CH3 domains. Also included in the. 
invention are antigen-binding fragments also comprising any combination of variable 

10 region(s) with a hinge region, CHI , CH2, and CH3 domains. The antibodies of the invention 
may be from any animal origin including birds and mammals. Preferably, the antibodies are 
human, murine (e.g., mouse and rat), donkey, ship rabbit, goat, guinea pig, camel, horse, or 
chicken. As used herein, "human" antibodies include antibodies having the amino acid 
sequence of a human immunoglobulin and include antibodies isolated from human 

15 immunoglobulin libraries or from animals transgenic for one or more human 
immunoglobulin and that do not express endogenous immunoglobulins, as described infra 
and, for example in, U.S. Patent No. 5,939,598 by Kucherlapati et al. 

The antibodies of the present invention may be monospecific, bispecific, trispecific or 
of greater multispecificity. Multispecific antibodies may be specific for different epitopes of 

20 a polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for a heterologous epitope, such as a heterologous polypeptide or solid 
support material. See, e.g., PCT publications WO 93/17715; WO 92/08802; WO 91/00360; 
WO 92/05793; Tutt, et al., J. Immunol. 147:60-69 (1991); U.S. Patent Nos. 4,474,893; 
4,714,681; 4,925,648; 5,573,920; 5,601,819; Kostelny et al., J. Immunol. 148:1547-1553 

25 (1992). 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or portion(s) of a polypeptide of the present invention which they recognize or 
specifically bind. The epitope(s) or polypeptide portion(s) may be specified as described 
herein, e.g., by N-terminal and C-terminal positions, or by size in contiguous amino acid 
30 residues. Antibodies which specifically bind any epitope or polypeptide of the present 
invention may also be excluded. Therefore, the present invention includes antibodies that 
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specifically bind polypeptides of the present invention, and allows for the exclusion of the 
same. 

Antibodies of the present invention may also be described or specified in terms of 
their cross-reactivity. Antibodies that do not bind any other analog, ortholog, or homolog of 
5 a polypeptide of the present invention are included. Antibodies that bind polypeptides with 
at least 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, 
at least 60%, at least 55%, and at least 50% identity (as calculated using methods known in 
the art and described herein) to a polypeptide of the present invention are also included in the 
present invention. In specific embodiments, antibodies of the present invention cross-react 

10 with murine, rat and/or rabbit homologs of human proteins and the corresponding epitopes 
thereof Antibodies that do not bind polypeptides with less than 95%, less than 90%, less than 
85%, less than 80%, less than 75%, less than 70%, less than 65%, less than 60%, less than 
55%, and less than 50% identity (as calculated using methods known in the art and described 
herein) to a polypeptide of the present invention are also included in the present invention. 

15 In a specific embodiment, the above-described cross-reactivity is with respect to any single 
specific antigenic or immunogenic polypeptide, or combination(s) of 2, 3, 4, 5, or more of the 
specific antigenic and/or immunogenic polypeptides disclosed herein. Further included in the 
present invention are antibodies which bind polypeptides encoded by polynucleotides which 
hybridize to a polynucleotide of the present invention under stringent hybridization 

20 conditions (as described herein). Antibodies of the present invention may also be described 
or specified in terms of their binding affinity to a polypeptide of the invention. Preferred 
binding affinities include those with a dissociation constant or Kd less than 5X10"^ M, 10'^ 
M, 5 X 10'^ M, 10'^ M, 5 X 10*^ M, 10'^ M, 5 X 10'^ M, 10'^ M, 5 X 10'^ M, lO'^M, 5 X lO"^ 
M, 10^ M, 5 X 10'^ M, 10'^M,5X lO'^M, lO'^M, 5 X 10'*^ M, I0-'^M,5 X lO^** M, 10"^^ 

25 M, 5 X 10-'^ M, '^-^^ M, 5 X 10"^^ M, 10'*^ M, 5 X 10'*^ M, 10*^^ M, 5 X lO'*^ M, or '^'^^ M. 

The invention also provides antibodies that competitively inhibit binding of an 
antibody to an epitope of the invention as determined by any method known in the art for 
determining competitive binding, for example, the immunoassays described herein. In 
preferred embodiments, the antibody competitively inhibits binding to the epitope by at least 

30 95%, at least 90%, at least 85 %, at least 80%, at least 75%, at least 70%, at least 60%, or at 
least 50%. 
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Antibodies of the present invention may act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies 
which disrupt the receptor/ligand interactions with the polypeptides of the invention either 
partially or fully. Preferrably, antibodies of the present invention bind an antigenic epitope 
5 disclosed herein, or a portion thereof. The invention features both receptor-specific 
antibodies and iigand-specific antibodies. The invention also features receptor-specific 
antibodies which do not prevent ligand binding but prevent receptor activation. Receptor 
activation (i.e., signaling) may be determined by techniques described herein or otherwise 
known in the art. For example, receptor activation can be determined by detecting the 

10 phosphorylation (e.g., tyrosine or serine/threonine) of the receptor or its substrate by 
immunoprecipitation followed by western blot analysis (for example, as described supra). In 
specific embodiments, antibodies are provided that inhibit ligand activity or receptor activity 
by at least 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 
60%, or at least 50% of the activity in absence of the antibody. 

15 The invention also features receptor-specific antibodies which both prevent ligand 

binding and receptor activation as well as antibodies that recognize the receptor-ligand 
complex, and, preferably, do not specifically recognize the unbound receptor or the unbound 
ligand. Likewise, included in the invention are neutralizing antibodies which bind the ligand 
and prevent binding of the ligand to the receptor, as well as antibodies which bind the ligand, 

20 thereby preventing receptor activation, but do not prevent the ligand from binding the 
receptor. Further included in the invention are antibodies which activate the receptor. These 
antibodies may act as receptor agonists, i.e., potentiate or activate either all or a subset of the 
biological activities of the ligand-mediated receptor activation, for example, by inducing 
dimerization of the receptor. The antibodies may be specified as agonists, antagonists or 

25 inverse agonists for biological activities comprising the specific biological activities of the 
peptides of the invention disclosed herein. The above antibody agonists can be made using 
methods known in the art. See, e.g., PCT publication WO 96/40281; U.S. Patent No. 
5,811,097; Deng et al.. Blood 92(6): 1981-1988 (1998); Chen et al.. Cancer Res. 
58(16):3668-3678 (1998); Harrop et al., J. Immunol. 161(4): 1786-1794 (1998); Zhu et al., 

30 Cancer Res. 58(15):3209-3214 (1998); Yoon et al., J. Immunol. 160(7):3 170-3179 (1998); 
Prat et al., J. Cell. Sci. 11 l(Pt2):237-247 (1998); Pitard et al., J. Immunol. Methods 
205(2): 1 77- 190 (1997); Liautard et al.. Cytokine 9(4):233-24l (1997); Carlson et al., J. Biol. 



1 
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Chem. 272(17):! 1295-1 1301 (1997); Taryman et al., Neuron 14(4):755-762 (1995); Muller 
et al-, Structure 6(9):1 153-1 167 (1998); Bartunek et al., Cytokine 8(l):14-20 (1996) (which 
are all incorporated by reference herein in their entireties). 

Antibodies of the present invention may be used, for example, but not limited to, to 
5 purify, detect, and target the polypeptides of the present invention, including both in vitro and 
in vivo diagnostic and therapeutic methods. For example, the antibodies have use in 
immunoassays for qualitatively and quantitatively measuring levels of the polypeptides of the 
present invention in biological samples. See, e.g., Harlow et al., Antibodies: A Laboratory 
Manual, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988) (incorporated by reference 

10 herein in its entirety). 

As discussed in more detail below, the antibodies of the present invention may be 
used either alone or in combination with other compositions. The antibodies may further be 
recombinantly fused to a heterologous polypeptide at the N- or C-terminus or chemically 
conjugated (including covalently and non-covalently conjugations) to polypeptides or other 

15 compositions. For example, antibodies of the present invention may be recombinantly fused 
or conjugated to molecules useful as labels in detection assays and effector molecules such as 
heterologous polypeptides, drugs, radionuclides, or toxins. See, e.g., PCT publications WO 
92/08495; WO 91/14438; WO 89/12624; U.S. Patent No. 5,314,995; and EP 396,387. 

The antibodies of the invention include derivatives that are modified, i.e, by the 

20 covalent attachment of any type of molecule to the antibody such that covalent attachment 
does not prevent the antibody from generating an anti-idiotypic response. For example, but 
not by way of limitation, the antibody derivatives include antibodies that have been modified, 
e.g., by glycosylation, acetylation, pegylation, phosphylation, amidation, derivatization by 
known protecting/blocking groups, proteolytic cleavage, linkage to a cellular ligand or other 

25 protein, etc. Any of numerous chemical modifications may be carried out by known 
techniques, including, but not limited to specific chemical cleavage, acetylation, formylation, 
metabolic synthesis of tunicamycin, etc. Additionally, the derivative may contain one or 
more non-classical amino acids. 

The antibodies of the present invention may be generated by any suitable method 

30 known in the art. Polyclonal antibodies to an antigen-of- interest can be produced by various 
procedures well known in the art. For example, a polypeptide of the invention can be 
administered to various host animals including, but not limited to, rabbits, mice, rats, etc. to 
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induce the production of sera containing polyclonal antibodies specific for the antigen. 
Various adjuvants may be used to increase the immunological response, depending on the 
host species, and include but are not limited to, Freund*s (complete and incomplete), mineral 
gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic 
5 polyols, polyanions, peptides, oil emulsions, keyhole hmpet hemocyanins, dinitrophenol, and 
potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and 
corynebacterium parvum. Such adjuvants are also well known in the art. 

Monoclonal antibodies can be prepared using a wide variety of techniques known in 
the art including the use of hybridoma, recombinant, and phage display technologies, or a 

10 combination thereof. For example, monoclonal antibodies can be produced using hybridoma 
techniques including those known in the art and taught, for example, in Harlow et al.. 
Antibodies: A Laboratory Manual, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); 
Hammerling, et al., in: Monoclonal Antibodies and T-Cell Hybridomas 563-681 (Elsevier, 
N.Y., 1981) (said references incorporated by reference in their entireties). The term 

15 "monoclonal antibody" as used herein is not limited to antibodies produced through 
hybridoma technology. The term "monoclonal antibody" refers to an antibody that is 
derived from a single clone, including any eukaryotic, prokaryotic, or phage clone, and not 
the method by which it is produced. 

Methods for producing and screening for specific antibodies using hybridoma 

20 technology are routine and well known in the art and are discussed in detail in the Examples. 
In a non-limiting example, mice can be immunized with a polypeptide of the invention or a 
cell expressing such peptide. Once an immune response is detected, e.g., antibodies specific 
for the antigen are detected in the mouse serum, the mouse spleen is harvested and 
splenocytes isolated. The splenocytes are then fused by well known techniques to any 

25 suitable myeloma cells, for example cells from cell line SP20 available from the ATCC. 
Hybridomas are selected and cloned by limited dilution. The hybridoma clones are then 
assayed by methods known in the art for cells that secrete antibodies capable of binding a 
polypeptide of the invention. Ascites fluid, which generally contains high levels of 
antibodies, can be generated by immunizing mice with positive hybridoma clones. 

30 Accordingly, the present invention provides methods of generating monoclonal 

antibodies as well as antibodies produced by the method comprising culturing a hybridoma 
cell secreting an antibody of the invention wherein, preferably, the hybridoma is generated by 
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fusing splenocytes isolated from a mouse immunized with an antigen of the invention with 
myeloma cells and then screening the hybridomas resulting from the fusion for hybridoma 
clones that secrete an antibody able to bind a polypeptide of the invention. 

Antibody fragments which recognize specific epitopes may be generated by known 
5 techniques. For example, Fab and F(ab')2 fragments of the invention may be produced by 
proteolytic cleavage of immunoglobulin molecules, using enzymes such as papain (to 
produce Fab fragments) or pepsin (to produce F(ab')2 fragments). F(ab')2 fragments contain 
the variable region, the light chain constant region and the CHI domain of the heavy chain. 

10 For example, the antibodies of the present invention can also be generated using 

various phage display methods known in the art. In phage display methods, functional 
antibody domains are displayed on the surface of phage particles which carry the 
polynucleotide sequences encoding them. In a particular embodiment, such phage can be 
utilized to display antigen binding domains expressed from a repertoire or combinatorial 

15 antibody library (e.g., human or murine). Phage expressing an antigen binding domain that 
binds the antigen of interest can be selected or identified with antigen, e.g., using labeled 
antigen or antigen bound or captured to a solid surface or bead. Phage used in these methods 
are typically filamentous phage including fd and*M13 binding domains expressed from phage 
with Fab, Fv or disulfide stabilized Fv antibody domains recombinantly fiised to either the 

20 phage gene III or gene VIII protein. Examples of phage display methods that can be used to 
make the antibodies of the present invention include those disclosed in Brinkman et al., J. 
Immunol. Methods 182:41-50 (1995); Ames et al., J. Immunol. Methods 184:177-186 
(1995); Kettleborough et al., Eur. J. Immunol. 24:952-958 (1994); Persic et al., Gene 187 9- 
18 (1997); Burton et al.. Advances in Immunology 57:191-280 (1994); PCT application No. 

25 PCT/GB91/01I34; PCT publications WO 90/02809; WO 91/10737; WO 92/01047; WO 
92/18619; WO 93/11236; WO 95/15982; WO 95/20401; and U.S. Patent Nos. 5,698,426; 
5,223,409; 5,403,484; 5,580,717; 5,427,908; 5,750,753; 5,821,047; 5,571,698; 5,427,908; 
5,516,637; 5,780,225; 5,658,727; 5,733,743 and 5,969,108; each of which is incorporated 
herein by reference in its entirety. 

30 As described in the above references, after phage selection, the antibody coding 

regions from the phage can be isolated and used to generate whole antibodies, including 
human antibodies, or any other desired antigen binding fragment, and expressed in any 
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desired host, including mammalian cells, insect cells, plant cells, yeast, and bacteria, e.g., as 
described in detail below. For example, techniques to recombinantly produce Fab, Fab' and 
F(ab')2 fragments can also be employed using methods known in the art such as those 
disclosed in PCT publication WO 92/22324; MuUinax et al, BioTechniques 12(6):864-869 
5 (1992); and Sawai et al., AJRI 34:26-34 (1995); and Better et al., Science 240:1041-1043 
(1988) (said references incorporated by reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al.. Methods in 
Enzymology 203:46-88 (1991); Shu et al., PNAS 90:7995-7999 (1993); and Skerra et al., 

10 Science 240:1038-1040 (1988). For some uses, including in vivo use of antibodies in 
humans and in vitro detection assays, it may be preferable to use chimeric, humanized, or 
human antibodies. A chimeric antibody is a molecule in which different portions of the 
antibody are derived from different animal species, such as antibodies having a variable 
region derived from a murine monoclonal antibody and a human immunoglobulin constant 

15 region. Methods for producing chimeric antibodies are known in the art. See e.g., Morrison, 
Science 229:1202 (1985); Oi et al., BioTechniques 4:214 (1986); Gillies et al., (1989) J. 
Immunol. Methods 125:191-202; U.S. Patent Nos. 5,807,715; 4,816,567; and 4,816397, 
which are incorporated herein by reference in their entirety. Humanized antibodies are 
antibody molecules from non-human species antibody that binds the desired antigen having 

20 one or more complementarity determining regions (CDRs) from the non-human species and 
a framework regions from a human immunoglobulin molecule. Often, framework residues in 
the human framework regions will be substituted with the corresponding residue from the 
CDR donor antibody to alter, preferably improve, antigen binding. These framework 
substitutions are identified by methods well known in the art, e.g., by modeling of the 

25 interactions of the CDR and framework residues to identify framework residues important 
for antigen binding and sequence comparison to identify unusual framework residues at 
particular positions. (See, e.g.. Queen et al., U.S. Patent No. 5,585,089; Riechmann et al., 
Nature 332:323 (1988), which are incorporated herein by reference in their entireties.) 
Antibodies can be humanized using a variety of techniques known in the art including, for 

30 example, CDR-grafting (EP 239,400; PCT publication WO 91/09967; U.S. Patent Nos. 
5,225,539; 5,530,101; and 5,585,089), veneering or resurfacing (EP 592,106; EP 519,596; 
Padlan, Molecular Immunology 28(4/5):489-498 (1991); Studnicka et al.. Protein 
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Engineering 7(6):805-814 (1994); Roguska. et al., PNAS 91:969-973 (1994)), and chain 
shuffling (U.S. Patent No. 5,565,332). 

Completely human antibodies are particularly desirable for therapeutic treatment of 
human patients. Human antibodies can be made by a variety of methods known in the art 
5 including phage display methods described above using antibody libraries derived from 
human immunoglobulin sequences. See also, U.S. Patent Nos. 4,444,887 and 4,716,1 1 1; and 
PCT publications WO 98/46645, WO 98/50433, WO 98/24893, WO 98/16654, WO 
96/34096, WO 96/33735, and WO 91/10741; each of which is incorporated herein by 
reference in its entirety. 

10 Human antibodies can also be produced using transgenic mice which are incapable of 

expressing functional endogenous immunoglobulins, but which can express human 
immunoglobulin genes. For example, the human heavy and light chain immunoglobulin gene 
complexes may be introduced randomly or by homologous recombination into mouse 
embryonic stem cells. Alternatively, the human variable region, constant region, and 

15 diversity region may be introduced into mouse embryonic stem cells in addition to the human 
heavy and light chain genes. The mouse heavy and light chain immunoglobulin genes may 
be rendered non-functional separately or simultaneously with the introduction of human 
immunoglobulin loci by homologous recombination. In particular, homozygous deletion of 
the JH region prevents endogenous antibody production. The modified embryonic stem cells 

20 are expanded and microinjected into blastocysts to produce chimeric mice. The chimeric 
mice are then bred to produce homozygous offspring which express human antibodies. The 
transgenic mice are immunized in the normal fashion with a selected antigen, e.g., all or a 
portion of a polypeptide of the invention. Monoclonal antibodies directed against the 
antigen can be obtained from the immunized, transgenic mice using conventional hybridoma 

25 technology. The human immunoglobulin transgenes harbored by the transgenic mice 
rearrange during B cell differentiation, and subsequently undergo class switching and 
somatic mutation. Thus, using such a technique, it is possible to produce therapeutically 
useful IgG, IgA, IgM and IgE antibodies. For an overview of this technology for producing 
human antibodies, see Lonberg and Huszar, Int. Rev. Immunol. 13:65-93 (1995). For a 

30 detailed discussion of this technology for producing human antibodies and human 
monoclonal antibodies and protocols for producing such antibodies, see, e.g., PCT 
publications WO 98/24893; WO 92/01047; WO 96/34096; WO 96/33735; European Patent 
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No. 0 598 877; U.S. Patent Nos. 5,413,923; 5,625,126; 5,633,425; 5,569,825; 5,661,016; 
5,545,806; 5,814,318; 5,885,793; 5,916,771; and 5,939,598, which are incorporated by 
reference herein in their entirety. In addition, companies such as Abgenix, inc. (Freemont, 
CA) and Genpharm (San Jose, CA) can be engaged to provide human antibodies directed 
5 against a selected antigen using technology similar to that described above. 

Completely human antibodies which recognize a selected epitope can be generated 
using a technique referred to as "guided selection." In this approach a selected non-human 
monoclonal antibody, e.g., a mouse antibody, is used to guide the selection of a completely 
human antibody recognizing the same epitope. (Jespers et aL, Bio/technology 12:899-903 
10 (1988)). 

Further, antibodies to the polypeptides of the invention can, in turn, be utilized to 
generate anti-idiotype antibodies that ''mimic" polypeptides of the invention using techniques 
well known to those skilled in the art. (See, e.g., Greenspan & Bona, FASEB J. 7(5):437-444; 
(1989) and Nissinoff, J. Immunol. 147(8):2429-2438 (1991)). For example, antibodies 

15 which bind to and competitively inhibit polypeptide multimerization and/or binding of a 
polypeptide of the invention to a ligand can be used to generate anti-idiotypes that *'mimic" 
the polypeptide multimerization and/or binding domain and, as a consequence, bind to and 
neutralize polypeptide and/or its ligand. Such neutralizing anti-idiotypes or Fab fragments of 
such anti-idiotypes can be used in therapeutic regimens to neutralize polypeptide ligand. For 

20 example, such anti-idiotypic antibodies can be used to bind a polypeptide of the invention 
and/or to bind its ligands/receptors, and thereby block its biological activity. 

Polynucleotides Encoding Antibodies 

The invention further provides polynucleotides comprising a nucleotide sequence 

25 encoding an antibody of the invention and fragments thereof. The invention also 
encompasses polynucleotides that hybridize under stringent or alternatively, under lower 
stringency hybridization conditions, e.g., as defined supra, to polynucleotides that encode an 
antibody, preferably, that specifically binds to a polypeptide of the invention, preferably, an 
antibody that binds to a polypeptide having the amino acid sequence of SEQ ID NO:Y. 

30 The polynucleotides may be obtained, and the nucleotide sequence of the 

polynucleotides determined, by any method known in the art. For example, if the nucleotide 
sequence of the antibody is known, a polynucleotide encoding the antibody may be 
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assembled from chemically synthesized oligonucleotides (e.g., as described in Kutmeier et 
al., BioTechniques 17:242 (1994)), which, briefly, involves the synthesis of overlapping 
oligonucleotides containing portions of the sequence encoding the antibody, annealing and 
ligating of those oligonucleotides, and then amplification of the ligated oligonucleotides by 
5 PGR. 

Alternatively, a polynucleotide encoding an antibody may be generated from nucleic 
acid from a siiitable source. If a clone containing a nucleic acid encoding a particular 
antibody is not available, but the sequence of the antibody molecule is known, a nucleic acid 
encoding the immunoglobulin may be chemically synthesized or obtained from a suitable 

10 source (e.g., an antibody cDNA library, or a cDNA library generated from, or nucleic acid, 
preferably poly A+ RNA, isolated from, any tissue or cells expressing the antibody, such as 
hybridoma cells selected to express an antibody of the invention) by PGR amplification 
using synthetic primers hybridizable to the 3' and 5' ends of the sequence or by cloning using 
an oligonucleotide probe specific for the particular gene sequence to identify, e.g., a cDNA 

15 clone from a cDNA library that encodes the antibody. Amplified nucleic acids generated by 
PGR may then be cloned into replicable cloning vectors using any method well known in the 
art. 

Once the nucleotide sequence and corresponding amino acid sequence of the antibody 
is determined, the nucleotide sequence of the antibody may be manipulated using methods 

20 well known in the art for the manipulation of nucleotide sequences, e.g., recombinant DNA 
techniques, site directed mutagenesis, PGR, etc. (see, for example, the techniques described 
in Sambrook et al., 1990, Molecular Gloning, A Laboratory Manual, 2d Ed., Gold Spring 
Harbor Laboratory, Gold Spring Harbor, NY and Ausubel et al, eds., 1998, Current Protocols 
in Molecular Biology, John Wiley & Sons, NY, which are both incorporated by reference 

25 herein in their entireties ), to generate antibodies having a different amino acid sequence, for 
example to create amino acid substitutions, deletions, and/or insertions. 

In a specific embodiment, the amino acid sequence of the heavy and/or light chain 
variable domains may be inspected to identify the sequences of the complementarity 
determining regions (GDRs) by methods that are well know in the art, e.g., by comparison to 

30 known amino acid sequences of other heavy and light chain variable regions to determine the 
regions of sequence hypervariability. Using routine recombinant DNA techniques, one or 
more of the GDRs may be inserted within framework regions, e.g., into human framework 
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be naturally occurring or consensus framework regions, and preferably human framework 
regions (see, e.g., Chothia et ah, J. Mol. Biol. 278: 457-479 (1998) for a listing of human 
framework regions). Preferably, the polynucleotide generated by the combination of the 
5 framework regions and CDRs encodes an antibody that specifically binds a polypeptide of 
the invention. Preferably, as discussed supra, one or more amino acid substitutions may be 
made within the framework regions, and, preferably, the amino acid substitutions improve 
binding of the antibody to its antigen. Additionally, such methods may be used to make 
amino acid substitutions or deletions of one or more variable region cysteine residues 

10 participating in an intrachain disulfide bond to generate antibody molecules lacking one or 
more intrachain disulfide bonds. Other alterations to the polynucleotide are encompassed by 
the present invention and within the skill of the art. 

In addition, techniques developed for the production of "chimeric antibodies" 
(Morrison et al., Proc. Natl. Acad. Sci. 81:851-855 (1984); Neuberger et al.. Nature 

15 312:604-608 (1984); Takeda et al.. Nature 314:452-454 (1985)) by splicing genes from a 
mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can be used. As described supra, 
a chimeric antibody is a molecule in which different portions are derived from different 
animal species, such as those having a variable region derived from a murine mAb and a 

20 human immunoglobulin constant region, e.g., humanized antibodies. 

Alternatively, techniques described for the production of single chain antibodies (U.S. 
Patent No. 4,946,778; Bird, Science 242:423- 42 (1988); Huston et al., Proc. Natl. Acad. Sci, 
USA 85:5879-5883 (1988); and Ward et al.. Nature 334:544-54 (1989)) can be adapted to 
produce single chain antibodies. Single chain antibodies are formed by linking the heavy 

25 and light chain fragments of the Fv region via an amino acid bridge, resulting in a single 
chain polypeptide. Techniques for the assembly of functional Fv fragments in E. coli may 
also be used (Skerraet ah, Science 242: 1038- 1041 (1988)). 

Methods of Producing Antibodies 
30 The antibodies of the invention can be produced by any method known in the art for 

the synthesis of antibodies, in particular, by chemical synthesis or preferably, by recombinant 
expression techniques. 
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Recombinant expression of an antibody of the invention, or fragment, derivative or 
analog thereof, (e.g., a heavy or Hght chain of an antibody of the invention or a single chain 
antibody of the invention), requires construction of an expression vector containing a 
polynucleotide that encodes the antibody. Once a polynucleotide encoding an antibody 
5 molecule or a heavy or light chain of an antibody, or portion thereof (preferably containing 
the heavy or light chain variable domain), of the invention has been obtained, the vector for 
the production of the antibody molecule may be produced by recombinant DNA technology 
using techniques well known in the art. Thus, methods for preparing a protein by expressing 
a polynucleotide containing an antibody encoding nucleotide sequence are described herein. 

10 Methods which are well known to those skilled in the art can be used to construct expression 
vectors containing antibody coding sequences and appropriate transcriptional and 
translational control signals. These methods include, for example, in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. The invention, thus, 
provides replicable vectors comprising a nucleotide sequence encoding an antibody molecule 

15 of the invention, or a heavy or light chain thereof, or a heavy or light chain variable domain, 
operably linked to a promoter. Such vectors may include the nucleotide sequence encoding 
the constant region of the antibody molecule (see, e.g., PCT Publication WO 86/05807; PCT 
Publication WO 89/01036; and U.S. Patent No. 5,122,464) and the variable domain of the 
antibody may be cloned into such a vector for expression of the entire heavy or light chain. 

20 The expression vector is transferred to a host cell by conventional techniques and the 

transfected cells are then cultured by conventional techniques to produce an antibody of the 
invention. Thus, the invention includes host cells containing a polynucleotide encoding an 
antibody of the invention, or a heavy or light chain thereof, or a single chain antibody of the 
invention, operably linked to a heterologous promoter. In preferred embodiments for the 

25 expression of double-chained antibodies, vectors encoding both the heavy and light chains 
may be co-expressed in the host cell for expression of the entire immunoglobulin molecule, 
as detailed below. 

A variety of host-expression vector systems may be utilized to express the antibody 
molecules of the invention. Such host-expression systems represent vehicles by which the 
30 coding sequences of interest may be produced and subsequently purified, but also represent 
cells which may, when transformed or transfected with the appropriate nucleotide coding 
sequences, express an antibody molecule of the invention in situ. These include but are not 
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limited to microorganisms such as bacteria (e.g., E. coli, B, subtilis) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 
containing antibody coding sequences; yeast (e.g., Saccharomyces, Pichia) transformed with 
recombinant yeast expression vectors containing antibody coding sequences; insect cell 
5 systems infected with recombinant virus expression vectors (e.g., baculovirus) containing 
antibody coding sequences; plant cell systems infected with recombinant virus expression 
vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed 
with recombinant plasmid expression vectors (e.g., Ti plasmid) containing antibody coding 
sequences; or mammalian cell systems (e.g., COS, CHO, BHK, 293, 3T3 cells) harboring 

10 recombinant expression constructs containing promoters derived from the genome of 
mammalian cells (e.g., metallothionein promoter) or from mammalian viruses (e.g., the 
adenovirus late promoter; the vaccinia virus 7.5K promoter). Preferably, bacterial cells 
such as Escherichia coli, and more preferably, eukaryotic cells, especially for the expression 
of whole recombinant antibody molecule, are used for the expression of a recombinant 

15 antibody molecule. For example, mammalian cells such as Chinese hamster ovary cells 
(CHO), in conjunction with a vector such as the major intermediate early gene promoter 
element from human cytomegalovirus is an effective expression system for antibodies 
(Foecking et al., Gene 45:101 (1986); Cockett et al., Bio/Technology 8:2 (1990)). 

In bacterial systems, a number of expression vectors may be advantageously selected 

20 depending upon the use intended for the antibody molecule being expressed. For example, 
when a large quantity of such a protein is to be produced, for the generation of 
pharmaceutical compositions of an antibody molecule, vectors which direct the expression 
of high levels of fusion protein products that are readily purified may be desirable. Such 
vectors include, but are not limited, to the E. coli expression vector pUR278 (Ruther et al., 

25 EMBO J. 2:1791 (1983)), in which the antibody coding sequence may be ligated 
individually into the vector in frame with the lac Z coding region so that a fusion protein is 
produced; pIN vectors (Inouye & Inouye, Nucleic Acids Res. 13:3101-3109 (1985); Van 
Heeke & Schuster, J. Biol. Chem. 24:5503-5509 (1989)); and the like. pGEX vectors may 
also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase 

30 (GST). In general, such fusion proteins are soluble and can easily be purified from lysed 
cells by adsorption and binding to matrix glutathione-agarose beads followed by elution in 
the presence of free glutathione. The pGEX vectors are designed to include thrombin or 
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factor Xa protease cleavage sites so that the cloned target gene product can be released from 
the GST moiety. 

In an insect system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is 
used as a vector to express foreign genes. The^virus grows in Spodoptera fnigiperda cells. 
5 The antibody coding sequence may be cloned individually into non-essential regions (for 
example the polyhedrin gene) of the virus and placed under control of an AcNPV promoter 
(for example the polyhedrin promoter). 

In mammalian host cells, a number of viral-based expression systems may be utilized, 
in cases where an adenovirus is used as an expression vector, the antibody coding sequence 

10 of interest may be ligated to an adenovirus transcription/translation control complex, e.g., the 
late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non- essential region 
of the viral genome (e.g., region El or E3) will result in a recombinant virus that is viable 
and capable of expressing the antibody molecule in infected hosts, (e.g., see Logan & 

15 Shenk, Proc. Natl. Acad. Sci. USA 81:355-359 (1984)). Specific initiation signals may also 
be required for efficient translation of inserted antibody coding sequences. These signals 
include the ATG initiation codon and adjacent sequences. Furthermore, the initiation codon 
must be in phase with the reading frame of the desired coding sequence to ensure translation 
of the entire insert. These exogenous translational control signals and initiation codons can 

20 be of a variety of origins, both natural and synthetic. The efficiency of expression may be 
enhanced by the inclusion of appropriate transcription enhancer elements, transcription 
terminators, etc. (see Bittner et al.. Methods in Enzymol. 153:51-544 (1987)). 

In addition, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion 

25 desired. Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein 
products may be important for the function of the protein. Different host cells have 
characteristic and specific mechanisms for the post-translational processing and modification 
of proteins and gene products. Appropriate cell lines or host systems can be chosen to 
ensure the correct modification and processing of the foreign protein expressed. To this end, 

30 eukaryotic host cells which possess the cellular machinery for proper processing of the 
primary transcript, glycosylation, and phosphorylation of the gene product may be used. 
Such mammalian host cells include but are not limited to CHO, VERY, BHK. Hela, COS, 
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MDCK, 293, 3T3, WI38, and in particular, breast cancer cell lines such as, for example, 
BT483, Hs578T, HTB2, BT20 and T47D, and nonnal mammary gland cell line such as, for 
example, CRL7030 and Hs578Bst. 

For long-term, high-yield production of recombinant proteins, stable expression is 
5 preferred. For example, cell lines which stably express the antibody molecule may be 
engineered. Rather than using expression vectors which contain viral origins of replication, 
host cells can be transformed with DNA controlled by appropriate expression control 
elements (e.g., promoter, enhancer, sequences, transcription terminators, polyadenylation 
sites, etc.), and a selectable marker. Following the introduction of the foreign DNA, 

10 engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are 
switched to a selective media. The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably integrate the plasmid into their 
chromosomes and grow to form foci which in turn can be cloned and expanded into cell 
lines. This method may advantageously be used to engineer cell lines which express the 

15 antibody molecule. Such engineered cell lines may be particularly useful in screening and 
evaluation of compounds that interact directly or indirectly with the antibody molecule. 

A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wigler et al.. Cell 11:223 (1977)), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska & Szybalski, Proc. Natl. Acad. Sci. USA 48:202 

20 (1992)), and adenine phosphoribosyltransferase (Lowy et al.. Cell 22:817 (1980)) genes can 
be employed in tk-, hgprt- or aprt- cells, respectively. Also, antimetabolite resistance can be 
used as the basis of selection for the following genes: dhfr, which confers resistance to 
methotrexate (Wigler et al., Natl. Acad. Sci. USA 77:357 (1980); O'Hare et al., Proc. Natl. 
Acad. Sci. USA 78:1527 (1981)); gpt, which confers resistance to mycophenolic acid 

25 (Mulligan & Berg, Proc. Natl. Acad. Sci. USA 78:2072 (1981)); neo, which confers 
resistance to the aminoglycoside G-418 Clinical Pharmacy 12:488-505; Wu and Wu, 
Biotherapy 3:87-95 (1991); Tolstoshev, Ann. Rev. Pharmacol. Toxicol. 32:573-596 (1993); 
Mulligan, Science 260:926-932 (1993); and Morgan and Anderson, Ann. Rev. Biochem. 
62:191-217 (1993); May, 1993, TIB TECH 1 1(5): 155-215); and hygro, which confers 

30 resistance to hygromycin (Santerre et al.. Gene 30:147. (1984)). Methods commonly known 
in the art of recombinant DNA technology may be routinely applied to select the desired 
recombinant clone, and such methods are described, for example, in Ausubel et al. (eds.), 
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Current Protocols in Molecular Biology, John Wiley & Sons, NY (1993); Kriegler, Gene 
Transfer and Expression, A Laboratory Manual, Stockton Press, NY (1990); and in Chapters 
12 and 13, Dracopoli et al. (eds), Current Protocols in Human Genetics, John Wiley & Sons, 
NY (1994); Colberre-Garapin et al., J. Mol. Biol. 150:1 (1981), which are incorporated by 
5 reference herein in their entireties. 

The expression levels of an antibody molecule can be increased by vector 
amplification (for a review, see Bebbington and Hentschel, The use of vectors based on gene 
amplification for the expression of cloned genes in mammalian cells in DNA cloning, Vol.3. 
(Academic Press, New York, 1987)). When a marker in the vector system expressing 

10 antibody is amplifiable, increase in the level of inhibitor present in culture of host cell will 
increase the number of copies of the marker gene. Since the amplified region is associated 
with the antibody gene, production of the antibody will also increase (Crouse et al., Mol. 
Cell. Biol. 3:257 (1983)), 

The host cell may be co-transfected with two expression vectors of the invention, the 

15 first vector encoding a heavy chain derived polypeptide and the second vector encoding a 
light chain derived polypeptide. The two vectors may contain identical selectable markers 
which enable equal expression of heavy and light chain polypeptides. Alternatively, a single 
vector may be used which encodes, and is capable of expressing, both heavy and light chain 
polypeptides. In such situations, the light chain should be placed before the heavy chain to 

20 avoid an excess of toxic free heavy chain (Proudfoot, Nature 322:52 (1986); Kohler, Proc. 
Natl. Acad. Sci. USA 77:2197 (1980)). The coding sequences for the heavy and light chains 
may comprise cDNA or genomic DNA. 

Once an antibody molecule of the invention has been produced by an animal, 
chemically synthesized, or recombinantly expressed, it may be purified by any method 

25 known in the art for purification of an immunoglobulin molecule, for example, by 
chromatography (e.g., ion exchange, affinity, particularly by affinity for the specific antigen 
after Protein A, and sizing column chromatography), centrifugation, differential solubility, or 
by any other standard technique for the purification of proteins. In addition, the antibodies of 
the present invention or fragments thereof can be fused to heterologous polypeptide 

30 sequences described herein or otherwise known in the art, to facilitate purification. 

The present invention encompasses antibodies recombinantly fused or chemically 
conjugated (including both covalently and non-covalently conjugations) to a polypeptide (or 
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portion thereof, preferably at least 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 amino acids of the 
polypeptide) of the present invention to generate fusion proteins. The fusion does not 
necessarily need to be direct, but may occur through linker sequences. The antibodies may 
be specific for antigens other than polypeptides (or portion thereof, preferably at least 10, 20, 
5 30, 40, 50, 60, 70, 80, 90 or 100 amino acids of the polypeptide) of the present invention. 
For example, antibodies may be used to target the polypeptides of the present invention to 
particular cell types, either in vitro or in vivo, by fusing or conjugating the polypeptides of 
the present invention to antibodies specific for particular cell surface receptors. Antibodies 
fiised or conjugated to the polypeptides of the present invention may also be used in in vitro 

10 immunoassays and purification methods using methods known in the art. See e.g., Harbor et 
al., supra, and PCT publication WO 93/21232; EP 439,095; Naramura et al., Immunol. Lett. 
39:91-99 (1994); U.S. Patent 5,474,981; Gillies et al., PNAS 89:1428-1432 (1992); Fell et 
al., J. Immunol. 146:2446-2452(1991), which are incorporated by reference in their entireties. 
The present invention further includes compositions comprising the polypeptides of 

15 the present invention fused or conjugated to antibody domains other than the variable 
regions. For example, the polypeptides of the present invention may be ftised or conjugated 
to an antibody Fc region, or portion thereof The antibody portion fused to a polypeptide of 
the present invention may comprise the constant region, hinge region, CHI domain, CH2 
domain, and CH3 domain or any combination of whole domains or portions thereof The 

20 polypeptides may also be fused or conjugated to the above antibody portions to form 
multimers. For example, Fc portions fused to the polypeptides of the present invention can 
form dimers through disulfide bonding between the Fc portions. Higher multimeric forms 
can be made by fusing the polypeptides to portions of IgA and IgM. Methods for fusing or 
conjugating the polypeptides of the present invention to antibody portions are known in the 

25 art. See, e.g., U.S. Patent Nos. 5,336,603; 5,622,929; 5,359,046; 5,349,053; 5,447,851; 
5,112,946; EP 307,434; EP 367,166; PCT publications WO 96/04388; WO 91/06570; 
Ashkenazi et al, Proc. Natl, Acad. Sci. USA 88:10535-10539 (1991); Zheng et al., J. 
Immunol. 154:5590-5600 (1995); and Vil et al., Proc. Natl. Acad. Sci. USA 89:1 1337- 
1 1341(1992) (said references incorporated by reference in their entireties). 

30 As discussed, supra, the polypeptides corresponding to a polypeptide, polypeptide 

fragment, or a variant of SEQ ID NO:Y may be fused or conjugated to the above antibody 
portions to increase the in vivo half life of the polypeptides or for use in immunoassays using 
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methods known in the art. Further, the polypeptides corresponding to SEQ ID NO: Y may be 
fused or conjugated to the above antibody portions to facilitate purification. One reported 
example describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains of 
5 mammalian immunoglobulins. (EP 394,827; Traunecker et al,, Nature 331:84-86 (1988). 
The polypeptides of the present invention fused or conjugated to an antibody having 
disulfide- linked dimeric structures (due to the IgG) may also be more efficient in binding 
and neutralizing other molecules, than the monomeric secreted protein or protein fragment 
alone. (Fountoulakis et al., J. Biochem, 270:3958-3964 (1995)). In many cases, the Fc part 

10 in a fusion protein is beneficial in therapy and diagnosis, and thus can result in, for example, 
improved pharmacokinetic properties. (EP A 232,262), Alternatively, deleting the Fc part 
after the fusion protein has been expressed, detected, and purified, would be desired. For 
example, the Fc portion may hinder therapy and diagnosis if the fusion protein is used as an 
antigen for immunizations. In drug discovery, for example, human proteins, such as hIL-5, 

15 have been fused with Fc portions for the purpose of high-throughput screening assays to 
identify antagonists of hIL-5. (See, Bennett et al., J. Molecular Recognition 8:52-58 (1995); 
Johanson et al., J. Biol. Chem. 270:9459-9471 (1995). 

Moreover, the antibodies or fragments thereof of the present invention can be fused to 
marker sequences, such as a peptide to facilitate purification. In preferred embodiments, the 

20 marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in a pQE 
vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among others, many of 
which are commercially available. As described in Gentz et al., Proc. Natl. Acad. Sci. USA 
86:821-824 (1989), for instance, hexa-histidine provides for convenient purification of the 
fusion protein. Other peptide tags useful for purification include, but are not limited to, the 

25 "HA" tag, which corresponds to an epitope derived from the influenza hemagglutinin protein 
(Wilson etal.. Cell 37:767 (1984)) and the "flag" tag. 

The present invention further encompasses antibodies or fragments thereof 
conjugated to a diagnostic or therapeutic agent. The antibodies can be used diagnostically 
to, for example, monitor the development or progression of a tumor as part of a clinical 

30 testing procedure to, e.g., determine the efficacy of a given treatment regimen. Detection 
can be facilitated by coupling the antibody to a detectable substance. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
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materials, bioluminescent materials, radioactive materials, positron emitting metals using 
various positron emission tomographies, and nonradioactive paramagnetic metal ions. The 
detectable substance may be coupled or conjugated either directly to the antibody (or 
fragment thereof) or indirectly, through an intermediate (such as, for example, a linker known 
in the art) using techniques known in the art. See, for example, U.S. Patent No. 4,741,900 for 
metal ions which can be conjugated to antibodies for use as diagnostics according to the 
present invention. Examples of suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, beta-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and aequorin; and examples of suitable radioactive material include 
1251, 1311, lllln or99Tc. 

Further, an antibody or fragment thereof may be conjugated to a therapeutic moiety 
such as a cytotoxin, e.g., a cytostatic or cytocidal agent, a therapeutic agent or a radioactive 
metal ion, e.g., alpha-emitters such as, for example, 213Bi. A cytotoxin or cytotoxic agent 
includes any agent that is detrimental to cells. Examples include paclitaxol, cytochalasin B, 
gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, 
vinblastine, coichicin, doxorubicin, daunorubicin, dihydroxy anthracin dione, mitoxantrone, 
mithramycin, actinomycin D, 1-dehydrotestosterone, glucocorticoids, procaine, tetracaine, 
lidocaine, propranolol, and puromycin and analogs or homologs thereof. Therapeutic agents 
include, but are not limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6- 
thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g., 
mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine 
(CCNU), cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and 
cis- dichlorodiamine platinum (II) (DDF) cisplatin), anthracyclines (e.g., daunorubicin 
(formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 
actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and anti-mitotic agents 
(e.g., vincristine and vinblastine). 

The conjugates of the invention can be used for modifying a given biological 
response, the therapeutic agent or drug moiety is not to be construed as limited to classical 
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chemical therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor 
necrosis factor, a-interferon, 6-interferon, nerve growth factor, platelet derived growth factor, 
5 tissue plasminogen activator, an apoptotic agent, e.g., TNF-alpha, TNF-beta, AIM I (See, 
International Publication No. WO 97/33899), AIM II (See, International Publication No. WO 
97/34911), Fas Ligand (Takahashi et aL Int. Immunol., 6:1567-1574 (1994)), VEGI (See, 
International Publication No. WO 99/23105), a thrombotic agent or an anti- angiogenic 
agent, e.g., angiostatin or endostatin; or, biological response modifiers such as, for example, 

10 lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("lL-2"), interleukin-6 ("IL-6"), 
granulocyte macrophage colony stimulating factor ("GM-CSF"), granulocyte colony 
stimulating factor ("G-CSF"), or other growth factors. 

Antibodies may also be attached to solid supports, which are particularly useful for 
immunoassays or purification of the target antigen. Such solid supports include, but are not 

15 limited to, glass, cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or 
polypropylene. 

Techniques for conjugating such therapeutic moiety to antibodies are well known, 
see, e.g., Amon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56 

20 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in Controlled 
Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, Inc. 1987); 
Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in 
Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 
475-506 (1985); "Analysis, Results, And Future Prospective Of The Therapeutic Use Of 

25 Radiolabeled Antibody In Cancer Therapy", in Monoclonal Antibodies For Cancer 
Detection And Therapy, Baldwin et al. (eds.), pp. 303-16 (Academic Press 1985), and Thorpe 
et al., "The Preparation And Cytotoxic Properties Of Antibody-Toxin Conjugates", 
Immunol. Rev. 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 

30 antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980, which is 
incorporated herein by reference in its entirety. 
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An antibody, with or without a therapeutic moiety conjugated to it, administered 
alone or in combination with cytotoxic factor(s) and/or cytokine(s) can be used as a 
therapeutic. 

5 Immunophenotyping 

The antibodies of the invention may be utilized for immunophenotyping of cell lines 
and biological samples. The translation product of the gene of the present invention may be 
useful as a cell specific marker, or more specifically as a cellular marker that is differentially 
expressed at various stages of differentiation and/or maturation of particular cell types. 
10 Monoclonal antibodies directed against a specific epitope, or combination of epitopes, will 
allow for the screening of cellular populations expressing the marker. Various techniques can 
be utilized using monoclonal antibodies to screen for cellular populations expressing the 
marker(s), and include magnetic separation using antibody-coated magnetic beads, "panning" 
with antibody attached to a solid matrix (i.e., plate), and flow cytometry (See, e.g., U.S. 
15 Patent 5,985,660; and Morrison et aL Cell. 96:111 A9 (1999)). 

These techniques allow for the screening of particular populations of cells, such as 
might be found with hematological malignancies (i.e. minimal residual disease (MRD) in 
acute leukemic patients) and "non-self cells in transplantations to prevent Graft-versus-Host 
Disease (GVHD). Alternatively, these techniques allow for the screening of hematopoietic 
20 stem and progenitor cells capable of undergoing proliferation and/or differentiation, as might 
be found in human umbilical cord blood. 

Assays For Antibody Binding 

The antibodies of the invention may be assayed for immunospecific binding by any 

25 method known in the art. The immunoassays which can be used include but are not limited 
to competitive and non-competitive assay systems using techniques such as western blots, 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), "sandwich" 
immunoassays, immunoprecipitation assays, precipitin reactions, gel diffusion precipitin 
reactions, immunodiffusion assays, agglutination assays, complement-fixation assays, 

30 immunoradiometric assays, fluorescent immunoassays, protein A immunoassays, to name 
but a few. Such assays are routine and well known in the art (see, e.g., Ausubel et al, eds, 
1994, Current Protocols in Molecular Biology, Vol. 1, John Wiley & Sons, Inc., New York, 
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which is incorporated by reference herein in its entirety). Exemplary immunoassays are 
described briefly below (but are not intended by way of limitation). 

Immunoprecipitation protocols generally comprise lysing a population of cells in a 
lysis buffer such as RlPA buffer (1% NP-40 or Triton X- 100, 1% sodium deoxycholate, 
5 0.1% SDS, 0.15 M NaCl, 0.01 M sodium phosphate at pH 7.2, 1% Trasylol) supplemented 
with protein phosphatase and/or protease inhibitors (e.g., EDTA, PMSF, aprotinin, sodium 
vanadate), adding the antibody of interest to the cell lysate, incubating for a period of time 
(e.g., 1-4 hours) at 4° C, adding protein A and/or protein G sepharose beads to the cell lysate, 
incubating for about an hour or more at 4° C, washing the beads in lysis buffer and 

10 resuspending the beads in SDS/sample buffer. The ability of the antibody of interest to 
immunoprecipitate a particular antigen can be assessed by, e.g., western blot analysis. One 
of skill in the art would be knowledgeable as to the parameters that can be modified to 
increase the binding of the antibody to an antigen and decrease the background (e.g., pre- 
clearing the cell lysate with sepharose beads). For further discussion regarding 

15 immunoprecipitation protocols see, e.g., Ausubel et al, eds, 1994, Current Protocols in 
Molecular Biology, Vol. 1, John Wiley & Sons, Inc., New York at 10.16.1. 

Western blot analysis generally comprises preparing protein samples, electrophoresis 
of the protein samples in a polyacrylamide gel (e.g., 8%- 20% SDS-PAGE depending on the 
molecular weight of the antigen), transferring the protein sample from the polyacrylamide gel 

20 to a membrane such as nitrocellulose, PVDF or nylon, blocking the membrane in blocking 
solution (e.g., PBS with 3% BSA or non-fat milk), washing the membrane in washing buffer 
(e.g., PBS-Tween 20), blocking the membrane with primary antibody (the antibody of 
interest) diluted in blocking buffer, washing the membrane in washing buffer, blocking the 
membrane with a secondary antibody (which recognizes the primary antibody, e.g., an anti- 

25 human antibody) conjugated to an enzymatic substrate (e.g., horseradish peroxidase or 
alkaline phosphatase) or radioactive molecule (e.g., 32P or 1251) diluted in blocking buffer, 
washing the membrane in wash buffer, and detecting the presence of the antigen. One of 
skill in the art would be knowledgeable as to the parameters that can be modified to increase 
the signal detected and to reduce the background noise. For further discussion regarding 

30 western blot protocols see, e.g., Ausubel et al, eds, 1994, Current Protocols in Molecular 
Biology, Vol. 1, John Wiley & Sons, Inc., New York at 10.8.1. 



wo 00/55173 



PCT/USOO/05881 



216 

ELISAs comprise preparing antigen, coating the well of a 96 well microtiter plate 
with the antigen, adding the antibody of interest conjugated to a detectable compound such 
as an enzymatic substrate (e.g., horseradish peroxidase or alkaline phosphatase) to the well 
and incubating for a period of time, and detecting the presence of the antigen. In ELISAs the 
5 antibody of interest does not have to be conjugated to a detectable compound; instead, a 
second antibody (which recognizes the antibody of interest) conjugated to a detectable 
compound may be added to the well. Further, instead of coating the well with the antigen, 
the antibody may be coated to the well. In this case, a second antibody conjugated to a 
detectable compound may be added following the addition of the antigen of interest to the 

10 coated well. One of skill in the art would be knowledgeable as to the parameters that can be 
modified to increase the signal detected as well as other variations of ELISAs known in the 
art. For further discussion regarding ELISAs see, e.g., Ausubel et al, eds, 1994, Current 
Protocols in Molecular Biology, Vol. I, John Wiley & Sons, Inc., New York at 1 1.2.1. 

The binding affinity of an antibody to an antigen and the off-rate of an antibody- 

15 antigen interaction can be determined by competitive binding assays. One example of a 
competitive binding assay is a radioimmunoassay comprising the incubation of labeled 
antigen (e.g., 3H or 1251) with the antibody of interest in the presence of increasing amounts 
of unlabeled antigen, and the detection of the antibody bound to the labeled antigen. The 
affinity of the antibody of interest for a particular antigen and the binding off-rates can be 

20 determined from the data by scatchard plot analysis. Competition with a second antibody 
can also be determined using radioimmunoassays. In this case, the antigen is incubated with 
antibody of interest conjugated to a labeled compound (e.g., 3H or 1251) in the presence of 
increasing amounts of an unlabeled second antibody. 

25 Therapeutic Uses 

The present invention is further directed to antibody-based therapies which involve 
administering antibodies of the invention to an animal, preferably a mammal, and most 
preferably a human, patient for treating one or more of the disclosed diseases, disorders, or 
conditions. Therapeutic compounds of the invention include, but are not limited to, 

30 antibodies of the invention (including fragments, analogs and derivatives thereof as described 
herein) and nucleic acids encoding antibodies of the invention (including fragments, analogs 
and derivatives thereof and anti-idiotypic antibodies as described herein). The antibodies of 
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the invention can be used to treat, inhibit or prevent diseases, disorders or conditions 
associated with aberrant expression and/or activity of a polypeptide of the invention, 
including, but not limited to, any one or more of the diseases, disorders, or conditions 
described herein. The treatment and/or prevention of diseases, disorders, or conditions 
5 associated with aberrant expression and/or activity of a polypeptide of the invention 
includes, but is not limited to, alleviating symptoms associated with those diseases, disorders 
or conditions. Antibodies of the invention may be provided in pharmaceutically acceptable 
compositions as known in the art or as described herein. 

A summary of the ways in which the antibodies of the present invention may be used 

10 therapeutically includes binding polynucleotides or polypeptides of the present invention 
locally or systemically in the body or by direct cytotoxicity of the antibody, e.g. as mediated 
by complement (CDC) or by effector cells (ADCC). Some of these approaches are 
described in more detail below. Armed with the teachings provided herein, one of ordinary 
skill in the art will know how to use the antibodies of the present invention for diagnostic, 

15 monitoring or therapeutic purposes without undue experimentation. 

The antibodies of this invention may be advantageously utilized in combination with 
other monoclonal or chimeric antibodies, or with lymphokines or hematopoietic growth 
factors (such as, e.g., lL-2, IL-3 and lL-7), for example, which serve to increase the number 
or activity of effector cells which interact with the antibodies. 

20 The antibodies of the invention may be administered alone or in combination with 

other types of treatments (e.g., radiation therapy, chemotherapy, hormonal therapy, 
immunotherapy and anti-tumor agents). Generally, administration of products of a species 
origin or species reactivity (in the case of antibodies) that is the same species as that of the 
patient is preferred. Thus, in a preferred embodiment, human antibodies, fragments 

25 derivatives, analogs, or nucleic acids, are administered to a human patient for therapy or 
prophylaxis. 

It is preferred to use high affinity and/or potent in vivo inhibiting and/or neutralizing 
antibodies against polypeptides or polynucleotides of the present invention, fragments or 
regions thereof, for both immunoassays directed to and therapy of disorders related to 
30 polynucleotides or polypeptides, including fragments thereof, of the present invention. Such 
antibodies, fragments, or regions, will preferably have an affinity for polynucleotides or 
polypeptides of the invention, including fragments thereof. Preferred binding affinities 
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include those with a dissociation constant or Kd less than 5 X 10'" M, 10'' M, 5 X 10'^ M, 
10'^ M, 5 X 10'^ M, 10-' M, 5 X 10'^ M, 10'^ M, 5 X 10"^ M, lO'^ M, 5 X 10"^ M, 10"' M, 5 X 
10*^ M, 10-^M, 5X 10-^ M, 10'^ M,5X 10*'"M, 10-'**M,5X lO'*' M, 10''* M, 5 X lO'^^M, 
lO-'-M, 5X 10"'^ M, 10"'^M,5X lO'^^M, 10-''M,5X IC'^M, and 10"^^ M. 

5 

Gene Therapy 

in a specific embodiment, nucleic acids comprising sequences encoding antibodies or 
functional derivatives thereof, are administered to treat, inhibit or prevent a disease or 
disorder associated with aberrant expression and/or activity of a polypeptide of the invention, 
10 by way of gene therapy. Gene therapy refers to therapy performed by the administration to a 
subject of an expressed or expressible nucleic acid. In this embodiment of the invention, the 
nucleic acids produce their encoded protein that mediates a therapeutic effect. 

Any of the methods for gene therapy available in the art can be used according to the 
present invention. Exemplary methods are described below. 

15 For general reviews of the methods of gene therapy, see Goldspiel et al.. Clinical 

Pharmacy 12:488-505 (1993); Wu and Wu, Biotherapy 3:87-95 (1991); Tolstoshev, Ann. 
Rev. Pharmacol, Toxicol. 32:573-596 (1993); Mulligan, Science 260:926-932 (1993); and 
Morgan and Anderson, Ann. Rev. Biochem. 62:191-217 (1993); May, TIBTECH 1 1(5):155- 
215 (1993), Methods commonly known in the art of recombinant DNA technology which can 

20 be used are described in Ausubel et al. (eds.), Current Protocols in Molecular Biology, John 
Wiley & Sons, NY (1993); and Kriegler, Gene Transfer and Expression, A Laboratory 
Manual, Stockton Press, NY (1990). 

In a preferred aspect, the compound comprises nucleic acid sequences encoding an 
antibody, said nucleic acid sequences being part of expression vectors that express the 

25 antibody or fragments or chimeric proteins or heavy or light chains thereof in a suitable host. 
In particular, such nucleic acid sequences have promoters operably linked to the antibody 
coding region, said promoter being inducible or constitutive, and, optionally, tissue-specific. 
In another particular embodiment, nucleic acid molecules are used in which the antibody 
coding sequences and any other desired sequences are flanked by regions that promote 

30 homologous recombination at a desired site in the genome, thus providing for 
intrachromosomal expression of the antibody encoding nucleic acids (KoUer and Smithies, 
Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); Zijlstra et al.. Nature 342:435-438 (1989). 
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In specific embodiments, the expressed antibody molecule is a single chain antibody; 
alternatively, the nucleic acid sequences include sequences encoding both the heavy and 
light chains, or fragments thereof, of the antibody. 

Delivery of the nucleic acids into a patient may be either direct, in which case the 
5 patient is directly exposed to the nucleic acid or nucleic acid- carrying vectors, or indirect, in 
which case, cells are first transformed with the nucleic acids in vitro, then transplanted into 
the patient. These two approaches are known, respectively, as in vivo or ex vivo gene 
therapy. 

In a specific embodiment, the nucleic acid sequences are directly administered in 

10 vivo, where it is expressed to produce the encoded product. This can be accomplished by 
any of numerous methods known in the art, e.g., by constructing them as part of an 
appropriate nucleic acid expression vector and administering it so that they become 
intracellular, e.g., by infection using defective or attenuated retrovirals or other viral vectors 
(see U.S. Patent No. 4,980,286), or by direct injection of naked DNA, or by use of 

15 microparticle bombardment (e.g., a gene gun; BioHstic, Dupont), or coating with lipids or 
cell-surface receptors or transfecting agents, encapsulation in liposomes, microparticles, or 
microcapsules, or by administering them in linkage to a peptide which is known to enter the 
nucleus, by administering it in linkage to a ligand subject to receptor-mediated endocytosis 
(see, e.g., Wu and Wu, J. Biol. Chem. 262:4429-4432 (1987)) (which can be used to target 

20 cell types specifically expressing the receptors), etc. In another embodiment, nucleic acid- 
ligand complexes can be formed in which the ligand comprises a fusogenic viral peptide to 
disrupt endosomes, allowing the nucleic acid to avoid lysosomal degradation. In yet another 
embodiment, the nucleic acid can be targeted in vivo for cell specific uptake and expression, 
by targeting a specific receptor (see, e.g., PCT Publications WO 92/06180; WO 92/22635; 

25 WO92/203I6; W093/14188, WO 93/20221). Alternatively, the nucleic acid can be 
introduced intracellularly and incorporated within host cell DNA for expression, by 
homologous recombination (Koller and Smithies, Proc. Natl. Acad. Sci. USA 86:8932-8935 
(1989); Zijlstra et al.. Nature 342:435-438 (1989)). 

In a specific embodiment, viral vectors that contains nucleic acid sequences encoding 

30 an antibody of the invention are used. For example, a retroviral vector can be used (see 
Miller et al., Meth. Enzymol. 217:581-599 (1993)). These retroviral vectors contain the 
components necessary for the correct packaging of the viral genome and integration into the 
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host cell DNA. The nucleic acid sequences encoding the antibody to be used in gene therapy 
are cloned into one or more vectors, which facilitates delivery of the gene into a patient. 
More detail about retroviral vectors can be found in Boesen et al., Biotherapy 6:291-302 
(1994), which describes the use of a retroviral vector to deliver the mdrl gene to 
5 hematopoietic stem cells in order to make the stem cells more resistant to chemotherapy. 
Other references illustrating the use of retroviral vectors in gene therapy are: Clowes et al., J. 
Clin. Invest. 93:644-651 (1994); Kiem et al., Blood 83:1467-1473 (1994); Salmons and 
Gunzberg, Human Gene Therapy 4:129-141 (1993); and Grossman and Wilson, Curr. Opin. 
in Genetics and Devel. 3:110-114 (1993). 

10 Adenoviruses are other viral vectors that can be used in gene therapy. Adenoviruses 

are especially attractive vehicles for delivering genes to respiratory epithelia. Adenoviruses 
naturally infect respiratory epithelia where they cause a mild disease. Other targets for 
adenovirus-based delivery systems are liver, the central nervous system, endothelial cells, 
arid muscle. Adenoviruses have the advantage of being capable of infecting non-dividing 

15 cells. Kozarsky and Wilson, Current Opinion in Genetics and Development 3:499-503 
(1993) present a review of adenovirus-based gene therapy. Bout et al., Human Gene 
Therapy 5:3-10 (1994) demonstrated the use of adenovirus vectors to transfer genes to the 
respiratory epithelia of rhesus monkeys. Other instances of the use of adenoviruses in gene 
therapy can be found in Rosenfeld et al.. Science 252:431-434 (1991); Rosenfeld et ai., Cell 

20 68:143- 155 (1992); Mastrangeli et al,, J. Clin. Invest. 91:225-234 (1993); PCT Publication 
W094/12649; and Wang, et al.. Gene Therapy 2:775-783 (1995). In a preferred 
embodiment, adenovirus vectors are used. 

Adeno-associated virus (AAV) has also been proposed for use in gene therapy (Walsh 
et al., Proc. Soc. Exp. Biol. Med. 204:289-300 (1993); U.S. Patent No. 5,436,146). 

25 Another approach to gene therapy involves transferring a gene to cells in tissue 

culture by such methods as electroporation, lipofection, calcium phosphate mediated 
transfection, or viral infection. Usually, the method of transfer includes the transfer of a 
selectable marker to the cells. The cells are then placed under selection to isolate those cells 
that have taken up and are expressing the transferred gene. Those cells are then delivered to 

30 a patient. 

In this embodiment, the nucleic acid is introduced into a cell prior to administration in 
vivo of the resulting recombinant cell. Such introduction can be carried out by any method 
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known in the art, including but not limited to transfection, electroporation, microinjection, 
infection with a viral or bacteriophage vector containing the nucleic acid sequences, cell 
fusion, chromosome-mediated gene transfer, microcell-mediated gene transfer, spheroplast 
fusion, etc. Numerous techniques are known in the art for the introduction of foreign genes 
5 into cells (see, e.g., Loeffler and Behr, Meth. Enzymol. 217:599-618 (1993); Cohen et aL, 
Meth. Enzymol. 217:618-644 (1993); Cline, Pharmac. Ther. 29:69-92m (1985) and may be 
used in accordance with the present invention, provided that the necessary developmental 
and physiological functions of the recipient cells are not disrupted. The technique should 
provide for the stable transfer of the nucleic acid to the cell, so that the nucleic acid is 

10 expressible by the cell and preferably heritable and expressible by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by various methods 
known in the art. Recombinant blood cells (e.g., hematopoietic stem or progenitor cells) are 
preferably administered intravenously. The amount of cells envisioned for use depends on 
the desired effect, patient state, etc., and can be determined by one skilled in the art. 

15 Cells into which a nucleic acid can be introduced for puqjoses of gene therapy 

encompass any desired, available cell type, and include but are not limited to epithelial cells, 
endothelial cells, keratinocytes, fibroblasts, muscle cells, hepatocytes; blood cells such as 
Tlymphocytes, Blymphocytes, monocytes, macrophages, neutrophils, eosinophils, 
megakaryocytes, granulocytes; various stem or progenitor cells, in particular hematopoietic 

20 stem or progenitor cells, e.g., as obtained from bone marrow, umbilical cord blood, 
peripheral blood, fetal liver, etc. 

In a preferred embodiment, the cell used for gene therapy is autologous to the patient. 
In an embodiment in which recombinant cells are used in gene therapy, nucleic acid 
sequences encoding an antibody are introduced into the cells such that they are expressible 

25 by the cells or their progeny, and the recombinant cells are then administered in vivo for 
therapeutic effect. In a specific embodiment, stem or progenitor cells are used. Any stem 
and/or progenitor cells which can be isolated and maintained in vitro can potentially be used 
in accordance with this embodiment of the present invention (see e.g. PCT PubUcation WO 
94/08598; Stemple and Anderson, Cell 71:973-985 (1992); Rheinwald, Meth. Cell Bio. 

30 21A:229 (1980); and Pittelkow and Scott, Mayo Clinic Proc. 61:771 (1986)). 

In a specific embodiment, the nucleic acid to be introduced for purposes of gene 
therapy comprises an inducible promoter operably linked to the coding region, such that 
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expression of the nucleic acid is controllable by controlling the presence or absence of the 
appropriate inducer of transcription. Demonstration of Therapeutic or Prophylactic Activity 
The compounds or pharmaceutical compositions of the invention are preferably tested 
in vitro, and then in vivo for the desired therapeutic or prophylactic activity, prior to use in 
5 humans. For example, in vitro assays to demonstrate the therapeutic or prophylactic utility of 
a compound or pharmaceutical composition include, the effect of a compound on a cell line 
or a patient tissue sample. The effect of the compound or composition on the cell line and/or 
tissue sample can be determined utilizing techniques known to those of skill in the art 
including, but not limited to, rosette formation assays and cell lysis assays. In accordance 
10 with the invention, in vitro assays which can be used to determine whether administration of 
a specific compound is indicated, include in vitro cell culture assays in which a patient tissue 
sample is grown in culture, and exposed to or otherwise administered a compound, and the 
effect of such compound upon the tissue sample is observed. 

1 5 Therapeutic/Prophylactic Administration and Composition 

The invention provides methods of treatment, inhibition and prophylaxis by 
administration to a subject of an effective amount of a compound or pharmaceutical 
composition of the invention, preferably a polypeptide or antibody of the invention. In a 
preferred aspect, the compound is substantially purified (e.g., substantially free from 

20 substances that limit its effect or produce undesired side-effects). The subject is preferably 
an animal, including but not limited to animals such as cows, pigs, horses, chickens, cats, 
dogs, etc., and is preferably a mammal, and most preferably human. 

Formulations and methods of administration that can be employed when the 
compound comprises a nucleic acid or an immunoglobulin are described above; additional 

25 appropriate formulations and routes of administration can be selected from among those 
described herein below. 

Various delivery systems are known and can be used to administer a compound of the 
invention, e.g., encapsulation in liposomes, microparticles, microcapsules, recombinant cells 
capable of expressing the compound, receptor-mediated endocytosis (see, e.g., Wu and Wu, 

30 J. Biol. Chem. 262:4429-4432 (1987)), construction of a nucleic acid as part of a retroviral 
or other vector, etc. Methods of introduction include but are not limited to intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, and oral 
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routes. The compounds or compositions may be administered by any convenient route, for 
example by infusion or bolus injection, by absorption through epithelial or mucocutaneous 
linings (e.g,, oral mucosa, rectal and intestinal mucosa, etc.) and may be administered 
together with other biologically active agents. Administration can be systemic or local. In 
5 addition, it may be desirable to introduce the pharmaceutical compounds or compositions of 
the invention into the central nervous system by any suitable route, including intraventricular 
and intrathecal injection; intraventricular injection may be facilitated by an intraventricular 
catheter, for example, attached to a reservoir, such as an Ommaya reservoir. Pulmonary 
administration can also be employed, e.g., by use of an inhaler or nebulizer, and formulation 
10 with an aerosolizing agent. 

In a specific embodiment, it may be desirable to administer the pharmaceutical 
compounds or compositions of the invention locally to the area in need of treatment; this may 
be achieved by, for example, and not by way of limitation, local infusion during surgery, 
topical application, e.g., in conjunction with a wound dressing after surgery, by injection, by 
15 means of a catheter, by means of a suppository, or by means of an implant, said implant 
being of a porous, non-porous, or gelatinous material, including membranes, such as sialastic 
membranes, or fibers. Preferably, when administering a protein, including an antibody, of 
the invention, care must be taken to use materials to which the protein does not absorb. 

In another embodiment, the compound or composition can be delivered in a vesicle, 
20 in particular a liposome (see Langer, Science 249:1527-1533 (1990); Treat et al., in 
Liposomes in the Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler 
(eds.), Liss, New York, pp. 353- 365 (1989); Lopez-Berestein, ibid., pp. 317-327; see 
generally ibid.) 

In yet another embodiment, the compound or composition can be delivered in a 
25 controlled release system. In one embodiment, a pump may be used (see Langer, supra; 
Sefton, CRC Crit. Ref Biomed. Eng. 14:201 (1987); Buchwald et al., Surgery 88:507 (1980); 
Saudek et al., N. Engl. J. Med. 321:574 (1989)). In another embodiment, polymeric 
materials can be used (see Medical Applications of Controlled Release, Langer and Wise 
(eds.), CRC Pres., Boca Raton, Florida (1974); Controlled Drug Bioavailability, Drug 
30 Product Design and Performance, Smolen and Ball (eds.), Wiley, New York (1984); Ranger 
and Peppas, J., Macromol. Sci. Rev. Macromol. Chem. 23:61 (1983); see also Levy et al.. 
Science 228:190 (1985); During et al., Ann. Neurol. 25:351 (1989); Howard et al.. 
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J.Neurosurg. 71 :105 (1989)). In yet another embodiment, a controlled release system can be 
placed in proximity of the therapeutic target, i.e., the brain, thus requiring only a fraction of 
the systemic dose (see, e.g., Goodson, in Medical Applications of Controlled Release, supra, 
vol. 2, pp. 115-138 (1984)). 
5 Other controlled release systems are discussed in the review by Langer (Science 

249:1527-1533 (1990)). 

In a specific embodiment where the compound of the invention is a nucleic acid 
encoding a protein, the nucleic acid can be administered in vivo to promote expression of its 
encoded protein, by constructing it as part of an appropriate nucleic acid expression vector 

10 and administering it so that it becomes intracellular, e.g., by use of a retroviral vector (see 
U.S. Patent No. 4,980,286), or by direct injection, or by use of microparticle bombardment 
(e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors or 
transfecting agents, or by administering it in linkage to a homeobox- like peptide which is 
known to enter the nucleus (see e.g., JoUot et al., Proc. Natl. Acad. Sci. USA 88:1864-1868 

15 (1991)), etc. Alternatively, a nucleic acid can be introduced intracellularly and incorporated 
within host cell DNA for expression, by homologous recombination. 

The present invention also provides pharmaceutical compositions. Such compositions 
comprise a therapeutically effective amount of a compound, and a pharmaceutically 
acceptable carrier. In a specific embodiment, the term "pharmaceutically acceptable" means 

20 approved by a regulatory agency of the Federal or a state government or listed in the U.S. 
Pharmacopeia or other generally recognized pharmacopeia for use in animals, and more 
particularly in humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle 
with which the therapeutic is administered. Such pharmaceutical carriers can be sterile 
liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic 

25 origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. Water is a 
preferred carrier when the pharmaceutical composition is administered intravenously. Saline 
solutions and aqueous dextrose and glycerol solutions can also be employed as liquid 
carriers, particularly for injectable solutions. Suitable pharmaceutical excipients include 
starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, silica gel, sodium stearate, 

30 glycerol monostearate, talc, sodium chloride, dried skim milk, glycerol, propylene, glycol, 
water, ethanol and the like. The composition, if desired, can also contain minor amounts of 
wetting or emulsifying agents, or pH buffering agents. These compositions can take the form 
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of solutions, suspensions, emulsion, tablets, pills, capsules, powders, sustained-release 
formulations and the like. The composition can be formulated as a suppository, with 
traditional binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, 
5 sodium saccharine, cellulose, magnesium carbonate, etc. Examples of suitable 
pharmaceutical carriers are described in "Remington's Pharmaceutical Sciences" by E.W. 
Martin. Such compositions will contain a therapeutically effective amount of the compound, 
preferably in purified form, together with a suitable amount of carrier so as to provide the 
form for proper administration to the patient. The formulation should suit the mode of 
, 10 administration. 

In a preferred embodiment, the composition is formulated in accordance with routine 
procedures as a pharmaceutical composition adapted for intravenous administration to 
human beings. Typically, compositions for intravenous administration are solutions in sterile 
isotonic aqueous buffer. Where necessary, the composition may also include a solubilizing 

15 agent and a local anesthetic such as lignocaine to ease pain at the site of the injection. 
Generally, the ingredients are supplied either separately or mixed together in unit dosage 
form, for example, as a dry lyophilized powder or water free concentrate in a hermetically 
sealed container such as an ampoule or sachette indicating the quantity of active agent. 
Where the composition is to be administered by infusion, it can be dispensed with an 

20 infusion bottle containing sterile pharmaceutical grade water or saline. Where the 
composition is administered by injection, an ampoule of sterile water for injection or saline 
can be provided so that the ingredients may be mixed prior to administration. 

The compounds of the invention can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with anions such as those derived 

25 from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and those formed with 
cations such as those derived from sodium, potassium, ammonium, calcium, ferric 
hydroxides, isopropylamine, triethylamine, 2-ethylamino ethanol, histidine, procaine, etc. 

The amount of the compound of the invention which will be effective in the 
treatment, inhibition and prevention of a disease or disorder associated with aberrant 

30 expression and/or activity of a polypeptide of the invention can be determined by standard 
clinical techniques. In addition, in vitro assays may optionally be employed to help identify 
optimal dosage ranges. The precise dose to be employed in the formulation will also depend 
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on the route of administration, and the seriousness of the disease or disorder, and should be 
decided according to the judgment of the practitioner and each patient's circumstances. 
Effective doses may be extrapolated from dose-response curves derived from in vitro or 
animal model test systems, 
5 For antibodies, the dosage administered to a patient is typically 0.1 mg/kg to 100 

mg/kg of the patient's body v^eight. Preferably, the dosage administered to a patient is 
betw^een 0.1 mg/kg and 20 mg/kg of the patient's body weight, more preferably I mg/kg to 
10 mg/kg of the patient's body weight. Generally, human antibodies have a longer half-life 
within the human body than antibodies from other species due to the immune response to the 

10 foreign polypeptides. Thus, lower dosages of human antibodies and less frequent 
administration is often possible. Further, the dosage and frequency of administration of 
antibodies of the invention may be reduced by enhancing uptake and tissue penetration (e.g., 
into the brain) of the antibodies by modifications such as, for example, lipidation. 

The invention also provides a pharmaceutical pack or kit comprising one or more 

15 containers filled with one or more of the ingredients of the pharmaceutical compositions of 
the invention. Optionally associated with such container(s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

20 

Diagnosis and Imaging 

Labeled antibodies, and derivatives and analogs thereof, which specifically bind to a 
polypeptide of interest can be used for diagnostic purposes to detect, diagnose, or monitor 
diseases, disorders, and/or conditions associated with the aberrant expression and/or activity 

25 of a polypeptide of the invention. The invention provides for the detection of aberrant 
expression of a polypeptide of interest, comprising (a) assaying the expression of the 
polypeptide of interest in cells or body fluid of an individual using one or more antibodies 
specific to the polypeptide interest and (b) comparing the level of gene expression with a 
standard gene expression level, whereby an increase or decrease in the assayed polypeptide 

30 gene expression level compared to the standard expression level is indicative of aberrant 
expression. 
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The invention provides a diagnostic assay for diagnosing a disorder, comprising (a) 
assaying the expression of the polypeptide of interest in cells or body fluid of an individual 
using one or more antibodies specific to the polypeptide interest and (b) comparing the level 
of gene expression with a standard gene expression level, whereby an increase or decrease in 
5 the assayed polypeptide gene expression level compared to the standard expression level is 
indicative of a particular disorder. With respect to cancer, the presence of a relatively high 
amount of transcript in biopsied tissue from an individual may indicate a predisposition for 
the development of the disease, or may provide a means for detecting the disease prior to the 
appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow 

10 health professionals to employ preventative measures or aggressive treatment earUer thereby 
preventing the development or further progression of the cancer. 

Antibodies of the invention can be used to assay protein levels in a biological sample 
using classical immunohistological methods known to those of skill in the art (e.g., see 
Jalkanen, et al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, et al., J. Cell . Biol. 105:3087- 

15 3096 (1987)). Other antibody-based methods useful for detecting protein gene expression 
include immunoassays, such as the enzyme linked immunosorbent assay (ELISA) and the 
radioimmunoassay (RIA). Suitable antibody assay labels are known in the art and include 
enzyme labels, such as, glucose oxidase; radioisotopes, such as iodine (1251, 1211), carbon 
(14C), sulfur (35S), tritium (3H), indium (1 121n), and technetium (99Tc); luminescent labels, 

20 such as luminol; and fluorescent labels, such as fluorescein and rhodamine, and biotin. 

One aspect of the invention is the detection and diagnosis of a disease or disorder 
associated with aberrant expression of a polypeptide of interest in an animal, preferably a 
mammal and most preferably a human. In one embodiment, diagnosis comprises: a) 
administering (for example, parenterally, subcutaneously, or intraperitoneally) to a subject 

25 an effective amount of a labeled molecule which specifically binds to the polypeptide of 
interest; b) waiting for a time interval following the administering for permitting the labeled 
molecule to preferentially concentrate at sites in the subject where the polypeptide is 
expressed (and for unbound labeled molecule to be cleared to background level); c) 
determining background level; and d) detecting the labeled molecule in the subject, such that 

30 detection of labeled molecule above the background level indicates that the subject has a 
particular disease or disorder associated with aberrant expression of the polypeptide of 
interest. Background level can be determined by various methods including, comparing the 
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amount of labeled molecule detected to a standard value previously .determined for a 
particular system. 

It will be understood in the art that the size of the subject and the imaging system 
used will determine the quantity of imaging moiety needed to produce diagnostic images. In 
5 the case of a radioisotope moiety, for a human subject, the quantity of radioactivity injected 
will normally range from about 5 to 20 millicuries of 99mTc. The labeled antibody or 
antibody fragment will then preferentially accumulate at the location of cells which contain 
the specific protein. In vivo tumor imaging is described in S.W. Burchiel et aL, 
"Immunopharmacokinetics of Radiolabeled Antibodies and Their Fragments." (Chapter 13 

10 in Tumor Imaging: The Radiochemical Detection of Cancer, S.W. Burchiel and B. A. 
Rhodes, eds., Masson Publishing Inc. (1982). 

Depending on several variables, including the type of label used and the mode of 
administration, the time interval following the administration for permitting the labeled 
molecule to preferentially concentrate at sites in the subject and for unbound labeled 

15 molecule to be cleared to background level is 6 to 48 hours or 6 to 24 hours or 6 to 12 hours. 
In another embodiment the time interval following administration is 5 to 20 days or 5 to 10 
days. 

In an embodiment, monitoring of the disease or disorder is carried out by repeating 
the method for diagnosing the disease or disease, for example, one month after initial 

20 diagnosis, six months after initial diagnosis, one year after initial diagnosis, etc. 

Presence of the labeled molecule can be detected in the patient using methods known 
in the art for in vivo scanning. These methods depend upon the type of label used. Skilled 
artisans will be able to determine the appropriate method for detecting a particular label. 
Methods and devices that may be used in the diagnostic methods of the invention include, but 

25 are not limited to, computed tomography (CT), whole body scan such as position emission 
tomography (PET), magnetic resonance imaging (MRI), and sonography. 

In a specific embodiment, the molecule is labeled with a radioisotope and is detected 
in the patient using a radiation responsive surgical instrument (Thurston et al., U.S. Patent 
No. 5,441,050). In another embodiment, the molecule is labeled with a fluorescent 

30 compound and is detected in the patient using a fluorescence responsive scanning instrument. 
In another embodiment, the molecule is labeled with a positron emitting metal and is detected 
in the patent using positron emission-tomography. In yet another embodiment, the molecule 
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is labeled with a paramagnetic label and is detected in a patient using magnetic resonance 
imaging (MRl). 

Kits ' 
5 The present invention provides kits that can be used in the above methods. In one 

embodiment, a kit comprises an antibody of the invention, preferably a purified antibody, in 
one or more containers. In a specific embodiment, the kits of the present invention contain a 
substantially isolated polypeptide comprising an epitope which is specifically 
immunoreactive with an antibody included in the kit. Preferably, the kits of the present 

10 invention further comprise a control antibody which does not react with the polypeptide of 
interest. In another specific embodiment, the kits of the present invention contain a means 
for detecting the binding of an antibody to a polypeptide of interest (e.g., the antibody may be 
conjugated to a detectable substrate such as a fluorescent compound, an enzymatic substrate, 
a radioactive compound or a luminescent compound, or a second antibody which recognizes 

15 the first antibody may be conjugated to a detectable substrate). 

In another specific embodiment of the present invention, the kit is a diagnostic kit for 
use in screening serum containing antibodies specific against proliferative and/or cancerous 
polynucleotides and polypeptides. Such a kit may include a control antibody that does not 
react with the polypeptide of interest. Such a kit may include a substantially isolated 

20 polypeptide antigen comprising an epitope which is specifically immunoreactive with at least 
one anti-poly peptide antigen antibody. Further, such a kit includes means for detecting the 
binding of said antibody to the antigen (e.g., the antibody may be conjugated to a fluorescent 
compound such as fluorescein or rhodamine which can be detected by flow cytometry). In 
specific embodiments, the kit may include a recombinantly produced or chemically 

25 synthesized polypeptide antigen. The polypeptide antigen of the kit may also be attached to a 
solid support. 

In a more specific embodiment the detecting means of the above-described kit 
includes a solid support to which said polypeptide antigen is attached. Such a kit may also 
include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of 
30 the antibody to the polypeptide antigen can be detected by binding of the said reporter- 
labeled antibody. 
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In an additional embodiment, the invention includes a diagnostic kit for use in 
screening serum containing antigens of the polypeptide of the invention. The diagnostic kit 
includes a substantially isolated antibody specifically immunoreactive with polypeptide or 
polynucleotide antigens, and means for detecting the binding of the polynucleotide or 
5 polypeptide antigen to the antibody. In one embodiment, the antibody is attached to a solid 
support. In a specific embodiment, the antibody may be a monoclonal antibody. The 
detecting means of the kit may include a second, labeled monoclonal antibody. 
Alternatively, or in addition, the detecting means may include a labeled, competing antigen. 
In one diagnostic configuration, test serum is reacted with a solid phase reagent 

10 having a surface-bound antigen obtained by the methods of the present invention. After 
binding with specific antigen antibody to the reagent and removing unbound serum 
components by washing, the reagent is reacted with reporter-labeled anti-human antibody to 
bind reporter to the reagent in proportion to the amount of bound anti-antigen antibody on the 
solid support. The reagent is again washed to remove unbound labeled antibody, and the 

15 amount of reporter associated with the reagent is determined. Typically, the reporter is an 
enzyme which is detected by incubating the solid phase in the presence of a suitable 
fluorometric, luminescent or colorimetric substrate (Sigma, St. Louis, MO). 

The solid surface reagent in the above assay is prepared by known techniques for 
attaching protein material to solid support material, such as polymeric beads, dip sticks, 96- 

20 well plate or filter material. These attachment methods generally include non-specific 
adsorption of the protein to the support or covalent attachment of the protein, typically 
through a free amine group, to a chemically reactive group on the solid support, such as an 
activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can 
be used in conjunction with biotinylated antigen(s). 

25 Thus, the invention provides an assay system or kit for carrying out this diagnostic 

method. The kit generally includes a support with surface- bound recombinant antigens, and 
a reporter-labeled anti-human antibody for detecting surface-bound anti-antigen antibody. 

Uses of the Polynucleotides 

30 Each of the polynucleotides identified herein can be used in numerous ways as 

reagents. The following description should be considered exemplary and utilizes known 
techniques. 



wo 00/55173 



PCT/USOO/05881 



231 

The breast/ovarian cancer antigen polynucleotides of the present invention are useful 
for chromosome identification. There exists an ongoing need to identify new chromosome 
markers, since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presently available. Each sequence is specifically targeted to and can 
5 hybridize with a particular location on an individual human chromosome, thus each 
polynucleotide of the present invention can routinely be used as a chromosome marker using 
techniques known in the art. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably at least 15 bp (e.g., 15-25 bp) from the sequences shown in SEQ ID NO:X, or the 

10 complement thereto. Primers can optionally be selected using computer analysis so that 
primers do not span more than one predicted exon in the genomic DNA. These primers are 
then used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to SEQ ID 
NO:X will yield an amplified fragment. 

15 Similarly, somatic hybrids provide a rapid method of PCR mapping the 

polynucleotides to particular chromosomes. Three or more clones can be assigned per day 
using a single thermal cycler. Moreover, sublocalization of the polynucleotides can be 
achieved with panels of specific chromosome fragments. Other gene mapping strategies that 
can be used include in situ hybridization, prescreening with labeled flow-sorted 

20 chromosomes, preselection by hybridization to construct chromosome specific-cDNA 
libraries, and computer mapping techniques (See, e.g., Shuler, Trends Biotechnol 16:456-459 
(1998) which is hereby incorporated by reference in its entirety). 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 

25 technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 2,000- 
4,000 bp are preferred. For a review of this technique, see Verma et al., "Human 
Chromosomes: a Manual of Basic Techniques," Pergamon Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to mark a 
single chromosome or a single site on that chromosome) or in panels (for marking multiple 

30 sites and/or multiple chromosomes). 
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Thus, the present invention also provides a method for chromosomal localization 
which involves (a) preparing PGR primers from the polynucleotide sequences in Table 3 and 
SEQ ID NO:X and (b) screening somatic cell hybrids containing individual chromosomes. 

The polynucleotides of the present invention would likewise be useful for radiation 
5 hybrid mapping, HAPPY mapping, and long range restriction mapping. For a review of these 
techniques and others known in the art, see, e.g. Dear, "Genome Mapping: A Practical 
Approach," IRL Press at Oxford University Press, London (1997); Aydin, J. Mol. Med. 
77:691-694 (1999); Hacia et al, Mol. Psychiatry 3:483-492 (1998); Herrick et al.. 
Chromosome Res. 7:409-423 (1999); Hamilton et al., Methods Cell Biol. 62:265-280 (2000); 
10 and/or Ott, J. Hered. 90:68-70 (1999) each of which is hereby incorporated by reference in its 
entirety. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage analysis 
establishes coinheritance between a chromosomal location and presentation of a particular 

15 disease. (Disease mapping data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man (available on line through Johns Hopkins University Welch Medical 
Library).) Assuming 1 megabase mapping resolution and one gene per 20 kb, a cDNA 
precisely localized to a chromosomal region associated with the disease could be one of 50- 
500 potential causative genes. 

20 Thus, once coinheritance is established, differences in a polynucleotide of the 

invention and the corresponding gene between affected and unaffected individuals can be 
examined. First, visible structural alterations in the chromosomes, such as deletions or 
translocations, are examined in chromosome spreads or by PCR. If no structural alterations 
exist, the presence of point mutations are ascertained. Mutations observed in some or all 

25 affected individuals, but not in normal individuals, indicates that the mutation may cause the 
disease. However, complete sequencing of the polypeptide and the corresponding gene from 
several normal individuals is required to distinguish the mutation from a polymorphism. If a 
new polymorphism is identified, this polymorphic polypeptide can be used for further linkage 
analysis. 

30 Furthermore, increased or decreased expression of the gene in affected individuals as 

compared to unaffected individuals can be assessed using the polynucleotides of the 
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invention. Any of these alterations (altered expression, chromosomal rearrangement, or 
mutation) can be used as a diagnostic or prognostic marker. 

Thus, the invention provides a method of detecting increased or decreased expression 
levels of the breast, ovarian, breast cancer and/or ovarian cancer polynucleotides in affected 
5 individuals as compared to unaffected individuals using polynucleotides of the present 
invention and techniques known in the art, including but not limited to the method described 
in Example 1 1. Any of these alterations (altered expression, chromosomal rearrangement, or 
mutation) can be used as a diagnostic or prognostic marker. 

Thus, the invention also provides a diagnostic method useful during diagnosis of a 

10 disorder related to the female reproductive system, particularly a disorder related to the breast 
and/or ovary, including breast cancer and/or ovarian cancer, involving measuring the 
expression level of breast/ovarian cancer antigen polynucleotides in breast and/or ovarian 
tissue or other cells or body fluid from an individual and comparing the measured gene 
expression level with a standard breast, ovarian, breast cancer and/or ovarian cancer 

15 polynucleotide expression level, whereby an increase or decrease in the gene expression level 
compared to the standard is indicative of a disorder related to the female reproductive system, 
particularly a disorder related to the breast and/or ovary, including breast cancer and/or 
ovarian cancer. 

In still another embodiment, the invention includes a kit for analyzing samples for the 
20 presence of proliferative and/or cancerous polynucleotides derived from a test subject. In a 
general embodiment, the kit includes at least one polynucleotide probe containing a 
nucleotide sequence that will specifically hybridize with a polynucleotide of the invention 
and a suitable container. In a specific embodiment, the kit includes two polynucleotide 
probes defining an internal region of the polynucleotide of the invention, where each probe 
25 has one strand containing a 3 1 'mer-end internal to the region. In a further embodiment, the 
probes may be useful as primers for polymerase chain reaction amplification. 

Where a diagnosis of a a disorder related to the female reproductive system, 
particularly a disorder related to the breast and/or ovary, including, for example, diagnosis of 
a tumor, has already been made according to conventional methods, the present invention is 
30 useful as a prognostic indicator, whereby patients exhibiting enhanced or depressed breast, 
ovarian, breast cancer and/or ovarian cancer polynucleotide expression will experience a 



wo 00/55173 



PCT/USOO/05881 



234 

worse clinical outcome relative to patients expressing the gene at a level nearer the standard 
level. 

By "measuring the expression level of breast, ovarian, breast cancer and/or ovarian 
cancer polynucleotides" is intended qualitatively or quantitatively measuring or estimating 
5 the level of the breast, ovarian, breast cancer and/or ovarian cancer polypeptide or the level 
of the mRNA encoding the breast, ovarian, breast cancer and/or ovarian cancer polypeptide 
in a first biological sample either directly (e.g., by determining or estimating absolute protein 
level or mRNA level) or relatively (e.g., by comparing to the breast, ovarian, breast cancer 
and/or ovarian cancer polypeptide level or mRNA level in a second biological sample). 

10 Preferably, the breast, ovarian, breast cancer and/or ovarian cancer polypeptide level or 
mRNA level in the first biological sample is measured or estimated and compared to a 
standard breast, ovarian, breast cancer and/or ovarian cancer polypeptide level or mRNA 
level, the standard being taken from a second biological sample obtained from an individual 
not having the female reproductive system related disorder or being determined by averaging 

15 levels from a population of individuals not having a female reproductive system related 
disorder. As will be appreciated in the art, once a standard breast, ovarian, breast cancer 
and/or ovarian cancer polypeptide level or mRNA level is known, it can be used repeatedly 
as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an 

20 individual, body fluid, cell line, tissue culture, or other source which contains breast, ovarian, 
breast cancer and/or ovarian cancer polypeptide or the corresponding mRNA. As indicated, 
biological samples include body fluids (such as vaginal pool, breast milk, lymph, sera, 
plasma, urine, semen, synovial fluid and spinal fluid) which contain the breast, ovarian, 
breast cancer and/or ovarian cancer polypeptide, breast and/or ovarian tissue, and other tissue 

25 sources found to express the breast, ovarian, breast cancer and/or ovarian cancer polypeptide. 
Methods for obtaining tissue biopsies and body fluids from mammals are well known in the 
art. Where the biological sample is to include mRNA, a tissue biopsy is the preferred source. 

The method(s) provided above may preferrably be applied in a diagnostic method 
and/or kits in which polynucleotides and/or polypeptides of the invention are attached to a 

30 solid support. In one exemplary method, the support may be a "gene chip" or a "biological 
chip" as described in US Patents 5,837,832, 5,874,219, and 5,856,174. Further, such a gene 
chip with breast, ovarian, breast cancer and/or ovarian cancer polynucleotides attached may 
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be used to identify polymorphisms between the breast, ovarian, breast cancer and/or ovarian 
cancer polynucleotide sequences, with polynucleotides isolated from a test subject. The 
knowledge of such polymorphisms (i.e. their location, as well as, their existence) would be 
beneficial in identifying disease loci for many disorders, such as for example, in neural 
5 disorders, immune system disorders, muscular disorders, reproductive disorders, 
gastrointestinal disorders, pulmonary disorders, cardiovascular disorders, renal disorders, 
proliferative disorders, and/or cancerous diseases and conditions, though most preferably in 
breast and/or ovarian related proliferative, and/or cancerous diseases and conditions. Such a 
method is described in US Patents 5,858,659 and 5,856,104. The US Patents referenced 

10 supra are hereby incorporated by reference in their entirety herein. 

The present invention encompasses breast, ovarian, breast cancer and/or ovarian 
cancer polynucleotides that are chemically synthesized, or reproduced as peptide nucleic 
acids (PNA), or according to other methods known in the art. The use of PNAs would serve 
as the preferred form if the polynucleotides of the invention are incorporated onto a solid 

15 support, or gene chip. For the purposes of the present invention, a peptide nucleic acid (PNA) 
is a polyamide type of DNA analog and the monomeric units for adenine, guanine,, thy mine 
and cytosine are available commercially (Perceptive Biosystems). Certain components of 
DNA, such as phosphorus, phosphorus oxides, or deoxyribose derivatives, are not present in 
PNAs. As disclosed by P. E. Nielsen, M. Egholm, R. H. Berg and O. Buchardt, Science 254, 

20 1497 (1991); and M. Egholm, O. Buchardt, L.Christensen, C. Behrens, S. M. Freier, D. A. 
Driver, R. H. Berg, S. K. Kim, B. Norden, and P. E. Nielsen, Nature 365, 666 (1993), PNAs 
bind specifically and tightly to complementary DNA strands and are not degraded by 
nucleases. In fact, PNA binds more strongly to DNA than DNA itself does. This is probably 
because there is no electrostatic repulsion between the two strands, and also the polyamide 

25 backbone is more flexible. Because of this, PNA/DNA duplexes bind under a wider range of 
stringency conditions than DNA/DNA duplexes, making it easier to perform multiplex 
hybridization. Smaller probes can be used than with DNA due to the strong binding. In 
addition, it is more likely that single base mismatches can be determined with PNA/DNA 
hybridization because a single mismatch in a PNA/DNA l5-mer lowers the melting point 

30 (T.sub.m) by 8^-20° C, vs. 4°-16'' C for the DNA/DNA l5-mer duplex. Also, the absence of 
charge groups in PNA means that hybridization can be done at low ionic strengths and reduce 
possible interference by salt during the analysis. 



wo 00/55173 



PCT/USOO/05881 



236 

The present invention have uses which include, but are not limited to, detecting 
cancer in mammals. In particular the invention is useful during diagnosis of pathological cell 
proliferative neoplasias which include, but are not limited to: acute myelogenous leukemias 
including acute monocytic leukemia, acute myeloblastic leukemia, acute promyelocytic 
5 leukemia, acute myelomonocytic leukemia, acute erythroleukemia, acute megakaryocytic 
leukemia, and acute undifferentiated leukemia, etc.; and chronic myelogenous leukemias 
including chronic myelomonocytic leukemia, chronic granulocytic leukemia, etc. Preferred 
mammals include monkeys, apes, cats, dogs, cows, pigs, horses, rabbits and humans. 
Particularly preferred are humans. 

10 Pathological cell proliferative disorders are often associated with inappropriate 

activation of proto-oncogenes. (Gelmann, E. P. et al., 'The Etiology of Acute Leukemia: 
Molecular Genetics and Viral Oncology," in Neoplastic Diseases of the Blood, Vol 1., 
Wiemik, P. H, et al. eds,, 161-182 (1985)). Neoplasias are now believed to result from the 
qualitative alteration of a normal cellular gene product, or from the quantitative modification 

15 of gene expression by insertion into the chromosome of a viral sequence, by chromosomal 
translocation of a gene to a more actively transcribed region, or by some other mechanism. 
(Gelmann et al., supra) It is likely that mutated or altered expression of specific genes is 
involved in the pathogenesis of some leukemias, among other tissues and cell types. 
(Gelmann et al., supra) Indeed, the human counterparts of the oncogenes involved in some 

20 animal neoplasias have been amplified or translocated in some cases of human leukemia and 
carcinoma. (Gelmann et al., supra) 

For example, c-myc expression is highly amplified in the non-lymphocytic leukemia 
cell line HL-60. When HL-60 cells are chemically induced to stop proliferation, the level of 
c-myc is found to be downregulated. (International Publication Number WO 91/15580). 

25 However, it has been shown that exposure of HL-60 cells to a DNA construct that is 
complementary to the 5* end of c-myc or c-myb blocks translation of the corresponding 
mRNAs which downregulates expression of the c-myc or c-myb proteins and causes arrest of 
cell proliferation and differentiation of the treated cells. (International Publication Number 
WO 91/15580; Wickstrom et al., Proc. Natl. Acad. Sci. 85:1028 (1988); Anfossi et al., Proc. 

30 Natl. Acad. Sci. 86:3379 (1989)). However, the skilled artisan would appreciate the present 
invention's usefulness is not limited to treatment of proliferative disorders of hematopoietic 
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cells and tissues, in light of the numerous cells and cell types of varying origins which are 
known to exhibit proliferative phenotypes. 

In addition to the foregoing, a breast/ovarian cancer antigen polynucleotide can be 
used to control gene expression through triple helix formation or through antisense DNA or 
5 RNA. Antisense techniques are discussed, for example, in Okano, J. Neurochem. 56; 560 
(1991); "Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988). Triple helix formation is discussed in, for instance Lee et al., Nucleic 
Acids Research 6: 3073 (1979); Cooney et al., Science 241: 456 (1988); and Dervan et al., 
Science 251: 1360 (1991). Both methods rely on binding of the polynucleotide to a 
10 complementary DNA or RNA. For these techniques, preferred polynucleotides are usually 
oligonucleotides 20 to 40 bases in length and complementary to either the region of the gene 
involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); 
Cooney et al., Science 241 :456 (1988); and Dervan et al., Science 251:1360 (1991) ) or to the 
mRNA itself (antisense - Okano, J. Neurochem. 56:560 (1991); Oligodeoxy-nucleotides as 

15 Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988).) Triple helix 
formation optimally results in a shut-off of RNA transcription from DNA, while antisense 
RNA hybridization blocks translation of an mRNA molecule into polypeptide. The 
oligonucleotide described above can also be delivered to cells such that the antisense RNA or 
DNA may be expressed in vivo to inhibit production of polypeptide of the present invention 

20 antigens. Both techniques are effective in model systems, and the information disclosed 
herein can be used to design antisense or triple helix polynucleotides in an effort to treat 
disease, and in particular, for the treatment of proliferative diseases and/or conditions. 

Polynucleotides of the present invention are also useful in gene therapy. One goal of 
gene therapy is to insert a normal gene into an organism having a defective gene, in an effort 

25 to correct the genetic defect. The polynucleotides disclosed in the present invention offer a 
means of targeting such genetic defects in a highly accurate manner. Another goal is to insert 
a new gene that was not present in the host genome, thereby producing a new trait in the host 
cell. 

The polynucleotides are also useful for identifying individuals from minute biological 
30 samples. The United States military, for example, is considering the use of restriction 
fragment length polymorphism (RFLP) for identification of its personnel. In this technique, 
an individual's genomic DNA is digested with one or more restriction enzymes, and probed 
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on a Southern blot to yield unique bands for identifying personnel. This method does not 
suffer from the current limitations of "Dog Tags" which can be lost, switched, or stolen, 
making positive identification difficult. The polynucleotides of the present invention can be 
used as additional DNA markers for RFLP. 
5 The polynucleotides of the present invention can also be used as an alternative to 

RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PGR primers for amplifying 
and isolating such selected DNA, which can then be sequenced. Using this technique, 
individuals can be identified because each individual will have a unique set of DNA 
10 sequences. Once an unique ID database is established for an individual, positive 
identification of that individual, living or dead, can be made from extremely small tissue 
samples. 

Forensic biology also benefits from using DNA-based identification techniques as 
disclosed herein. DNA sequences taken from very small biological samples such as tissues, 

15 e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, synovial fluid, amniotic fluid, 
breast milk, lymph, pulmonary sputum or surfactant, urine, fecal matter, etc., can be 
amplified using PGR. In one prior art technique, gene sequences amplified from 
polymorphic loci, such as DQa class II HLA gene, are used in forensic biology to identify 
individuals. (ErHch, H., PGR Technology, Freeman and Go. (1992).) Once these specific 

20 polymorphic loci are amplified, they are digested with one or more restriction enzymes, 
yielding an identifying set of bands on a Southern blot probed with DNA corresponding to 
the DQa class 11 HLA gene. Similarly, polynucleotides of the present invention can be used 
as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 

25 tissue. Such need arises, for example, in forensics when presented with tissue of unknown 
origin. Appropriate reagents can comprise, for example, DNA probes or primers specific to 
breast, ovarian, breast cancer and/or ovarian cancer polynucleotides prepared from the 
sequences of the present invention. Panels of such reagents can identify tissue by species 
and/or by organ type. In a similar fashion, these reagents can be used to screen tissue 

30 cultures for contamination. 

The polynucleotides of the present invention are also useful as hybridization probes 
for differential identification of the tissue(s) or cell type(s) present in a biological sample. 
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Similarly, polypeptides and antibodies directed to polypeptides of the present invention are 
useful to provide immunological probes for differential identification of the tissue(s) (e.g., 
immunohistochemistry assays) or cell type(s) (e.g., immunocytochemistry assays). In 
addition, for a number of disorders of the above tissues or cells, significantly higher or lower 
5 levels of gene expression of the polynucleotides/polypeptides of the present invention may be 
detected in certain tissues (e.g., tissues expressing polypeptides and/or polynucleotides of the 
present invention, breast, ovarian, breast cancer and/or ovarian cancer tissues and/or 
cancerous and/or wounded tissues) or bodily fluids (e.g., vaginal pool, breast milk, serum, 
plasma, urine, synovial fluid or spinal fluid) taken from an individual having such a disorder, 

10 relative to a "standard" gene expression level, i.e., the expression level in healthy tissue from 
an individual not having the disorder. 

Thus, the invention provides a diagnostic method of a disorder, which involves: (a) 
assaying gene expression level in cells or body fluid of an individual; (b) comparing the gene 
expression level with a standard gene expression level, whereby an increase or decrease in 

15 the assayed gene expression level compared to the standard expression level is indicative of a 
disorder. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences in the 
20 process of discovering novel polynucleotides, for selecting and making oligomers for 
attachment to a "gene chip" or other support, to raise anti-DNA antibodies using DNA 
immunization techniques, and as an antigen to elicit an immune response. 



Uses of the Polypeptides 
25 Each of the polypeptides identified herein can be used in numerous ways. The 

following description should be considered exemplary and utilizes known techniques. 

Polypeptides and antibodies directed to polypeptides of the present invention are 

useful to provide immunological probes for differential identification of the tissue(s) (e.g., 

immunohistochemistry assays such as, for example, ABC immunoperoxidase (Hsu et aL, J. 
30 Histochem. Cytochem. 29:577-580 (1981)) or cell type(s) (e.g., immunocytochemistry 

assays). 
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Antibodies can be used to assay levels of polypeptides encoded by polynucleotides of 
the invention in a biological sample using classical immunohistological methods known to 
those of skill in the art (e.g., see Jalkanen, et al., J. Cell. Biol. 101 :976-985 (1985); Jalkanen, 
et aL, J. Cell. Biol. 105:3087-3096 (1987)). Other antibody-based methods useful for 
5 detecting protein gene expression include immunoassays, such as the enzyme linked 
immunosorbent assay (ELISA) and the radioimmunoassay (RJA). Suitable antibody assay 
labels are known in the art and include enzyme labels, such as, glucose oxidase; 
radioisotopes, such as iodine ('^'l, '""1, '"^I, ^^^1), carbon ('^C), sulftir (^'S), tritium (^H), 
indium ("""In, "^'"In, ""In, '"in), and technetium (^^Tc, ^^'"Tc), thallium (^°'Ti), gallium 

10 (^^Ga, ^^'Ga), palladium ('^^Pd), molybdenum CMo\ xenon ('^^Xe), fluorine (*^F), *'^Sm, 
'"Lu, '-^^Gd, '^^Pm, '^Ya, '^^Yb, '^^Ho, ^^Y, '^^Sc, '^^Re, '^^Re, '''Pr, »^^Rh, ^^Ru; 
luminescent labels, such as luminol; and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

In addition to. assaying levels of polypeptide of the present invention in a biological 

15 sample, proteins can also be detected in vivo by imaging. Antibody labels or markers for in 
vivo imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
radiography, suitable labels include radioisotopes such as barium or cesium, which emit 
detectable radiation but are not overtly harmful to the subject. Suitable markers for NMR 
and ESR include those with a detectable characteristic spin, such as deuterium, which may be 

20 incorporated into the antibody by labeling of nutrients for the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with an 
appropriate detectable imaging moiety, such as a radioisotope (for example, "^In, ^^"*Tc, 

^UIj^ 125,^ 123j^ 121j^^ ^^^^^^ (14^^ g^,^^ ^35s)^ ^^j^j^^^ (3pj^^ .^jj^^^ ^USmj^^ ll3mj^^ 112^^^ lllj^^^ 

and technetium (^^Tc, ^^"'Tc), thallium (^"'Ti), gallium (^**Ga, *^^Ga), palladium (^^^Pd), 
25 molybdenum (^^Mo), xenon (^^^Xe), fluorine ('^F, '^^Sm, '^^Lu, '^^Gd, *>m, *^*^La, ^^^Yb, 
*^^Ho, ^°Y, ^^Sc, '^^Re, ^^^^Re, '^^Pr, **^^Rh, ^^Ru), a radio-opaque substance, or a material 
detectable by nuclear magnetic resonance, is introduced (for example, parenterally, 
subcutaneously or intraperitoneally) into the mammal to be examined for immune system 
disorder. It will be understood in the art that the size of the subject and the imaging system 
30 used will determine the quantity of imaging moiety needed to produce diagnostic images. In 
the case of a radioisotope moiety, for a human subject, the quantity of radioactivity injected 
will , normally range from about 5 to 20 millicuries of ^^'"Tc. The labeled antibody or 



wo 00/55173 PCT/USOO/05881 

241 

antibody fragment will then preferentially accumulate at the location of cells which express 
the polypeptide encoded by a polynucleotide of the invention, fn vivo tumor imaging is 
described in S.W. Burchiel et al., "Immunopharmacokinetics of Radiolabeled Antibodies and 
Their Fragments" (Chapter 13 in Tumor Imaging: The Radiochemical Detection of Cancer, 
5 S.W. Burchiel and B. A. Rliodes, eds., Masson Publishing Inc. (1982)). 

In one embodiment, the invention provides a method for the specific delivery of 
compositions of the invention to cells by administering polypeptides of the invention (e.g., 
polypeptides encoded by polynucleotides of the invention and/or antibodies) that are 
associated with heterologous polypeptides or nucleic acids. In one example, the invention 

10 provides a method for delivering a therapeutic protein into the targeted cell. In another 
example, the invention provides a method for delivering a single stranded nucleic acid (e.g., 
antisense or ribozymes) or double stranded nucleic acid (e.g., DNA that can integrate into the 
cell's genome or replicate episomally and that can be transcribed) into the targeted cell. 

In another embodiment, the invention provides a method for the specific destruction 

1 5 of cells (e.g., the destruction of tumor cells) by administering polypeptides of the invention in 
association with toxins or cytotoxic prodrugs. 

By "toxin" is meant one or more compounds that bind and activate endogenous 
cytotoxic effector systems, radioisotopes, holotoxins, modified toxins, catalytic subunits of 
toxins, or any molecules or enzymes not normally present in or on the surface of a cell that 

20 under defined conditions cause the cell's death. Toxins that may be used according to the 
methods of the invention include, but are not limited to, radioisotopes known in the art, 
compounds such as, for example, antibodies (or complement fixing containing portions 
thereof) that bind an inherent or induced endogenous cytotoxic effector system, thymidine 
kinase, endonuclease, RNAse, alpha toxin, ricin, abrin, Pseudomonas exotoxin A, diphtheria 

25 toxin, saporin, momordin, gelonin, pokeweed antiviral protein, alpha-sarcin and cholera 
toxin. "Toxin" also includes a cytostatic or cytocidal agent, a therapeutic agent or a 
radioactive metal ion, e.g., alpha-emitters such as, for example, ^'^Bi, or other radioisotopes 
such as, for example, '^'^Pd, '^^Xe, '^'l, ^^Ge, ^^Co, ^'Zn, «^Sr, ^^P, ''Y, *^^Sm, '"Gd, 
'*^Vb, ^'Cr, ^^Mn, '^Se, "^Sn, ''Vttrium, '^^Tin, ^^^Rhenium, '^^Holmium, and *^^Rhenium; 

30 luminescent labels, such as luminol; and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 
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Techniques known in the art may be apphed to label polypeptides of the invention 
(including antibodies). Such techniques include, but are not limited to, the use of 
bifunctional conjugating agents (see e.g., U.S. Patent Nos. 5,756,065; 5,714,631; 5,696,239; 
5,652,361; 5,505,931; 5,489,425; 5,435,990; 5,428,139; 5,342,604; 5,274,119; 4,994,560; 
5 and 5,808,003; the contents of each of which are hereby incorporated by reference in its 
entirety). 

Thus, the invention provides a diagnostic method of a disorder, which involves (a) 
assaying the expression level of a breast, ovarian, breast cancer and/or ovarian cancer 
polypeptide of the present invention in cells or body fluid of an individual, or more 

10 preferrably, assaying the expression level of a breast, ovarian, breast cancer and/or ovarian 
cancer of the present invention in breast and/or ovarian cells or vaginal pool or breast milk of 
an individual; and (b) comparing the assayed polypeptide expression level with a standard 
polypeptide expression level, whereby an increase or decrease in the assayed polypeptide 
expression level compared to the standard expression level is indicative of a disorder. With 

15 respect to cancer, the presence of a relatively high amount of transcript in biopsied tissue 
from an individual may indicate a predisposition for the development of the disease, or may 
provide a means for detecting the disease prior to the appearance of actual clinical symptoms. 
A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or 

20 further progression of the cancer. 

Moreover, breast/ovarian cancer antigen polypeptides of the present invention can be 
used to treat or prevent diseases or conditions such as, for example, neural disorders, immune 
system disorders, muscular disorders, reproductive disorders, gastrointestinal disorders, 
pulmonary disorders, cardiovascular disorders, renal disorders, proliferative disorders, and/or 

25 cancerous diseases and conditions, preferably proliferative disorders of the breast and/or 
ovary, and/or cancerous disease and conditions. For example, patients can be administered a 
polypeptide of the present invention in an effort to replace absent or decreased levels of the 
polypeptide (e.g., insulin), to supplement absent or decreased levels of a different polypeptide 
(e.g., hemoglobin S for hemoglobin B, SOD, catalase, DNA repair proteins), to inhibit the 

30 activity of a polypeptide (e.g., an oncogene or tumor supressor), to activate the activity of a 
polypeptide (e.g., by binding to a receptor), to reduce the activity of a membrane bound 
receptor by competing with it for free ligand (e.g., soluble TNF receptors used in reducing 
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10 



15 



inflammation), or to bring about a desired response (e.g., blood vessel growth inhibition, 
enhancement of the immune response to proUferative cells or tissues). 

Similarly, antibodies directed to a polypeptide of the present invention can also be 
used to treat disease (as described supra, and elsewhere herein). For example, administration 
of an antibody directed to a polypeptide of the present invention can bind, and/or neutralize 
the polypeptide, and/or reduce overproduction of the polypeptide. Similarly, administration 
of an antibody can activate the polypeptide, such as by binding to a polypeptide bound to a 
membrane (receptor). 

At the very least, the polypeptides of the present invention can be used as molecular 
weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns using 
methods well known to those of skill in the art. Polypeptides can also be used to raise 
antibodies, which in turn are used to measure protein expression from a recombinant cell, as 
a way of assessing transformation of the host cell. Moreover, the polypeptides of the present 
invention can be used to test the following biological activities. 



Gene Therapy Methods 

Another aspect of the present invention is to gene therapy methods for treating or 
preventing disorders, diseases and conditions. The gene therapy methods relate to the 
introduction of nucleic acid (DNA, RNA and antisense DNA or RNA) sequences into an 
20 animal to achieve expression of the polypeptide of the present invention. This method 
requires a polynucleotide which codes for a polypeptide of the present invention operatively 
linked to a promoter and any other genetic elements necessary for the expression of the 
polypeptide by the target tissue. Such gene therapy and delivery techniques are known in the 
art, see, for example, WO90/II092, which is herein incorporated by reference. 
25 Thus, for example, cells from a patient may be engineered with a polynucleotide 

(DNA or RNA) comprising a promoter operably linked to a polynucleotide of the present 
invention ex vivo, with the engineered cells then being provided to a patient to be treated 
with the polypeptide of the present invention. Such methods are well-known in the art. For 
example, see Belldegrun, A., et al., J. Natl. Cancer Inst. 85: 207-216 (1993); Ferrantini, M. et 
30 al., Cancer Research 53: 1107-1 112 (1993); Ferrantini, M. et al., J. Immunology 153: 4604- 
4615 (1994); Kaido, T., et al.. Int. J. Cancer 60: 221-229 (1995); Ogura, H., et al., Cancer 
Research 50: 5102-5106 (1990); Santodonato, L., et al., Human Gene Therapy 7:1-10 (1996); 



